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Preface 


ES 


Application of computer-oriented numerical methods has become an in- 
tegral part of the life of all the modern engineers and scientists. The 
advent of powerful small computers and workstations has tremendously 
increased the speed, power and flexibility of numerical computing. 
Recognizing the importance of computers and numerical computing, all 
the universities now offer courses in both computing and numerical 
mothods in their engineering curriculum. They are also trying to inte- 
grate computer-oriented numerical analysis into other courses such as 
mathematics and computer science. It is believed that students who can 
understand, enjoy, and successfully apply the methods of numerical com- 
puting to solve problems make better logicians and thereby better engi- 
neers and scientists. 

‘The contents of this book are der. гей from a study of curricula offered 
by various universities across the country. The book covers both the 
introductory topics and the more advanced topics such as eigenvalue 
problems and partial differential equations. The primary goal is to pro- 
vide students with a sound introduction of numerical methods as well as 
to make the learning a pleasurable experience. Logical arrangement of 
topics, clarity of presentation, and illustration through examples aid the 
student to become more and more adept in applying the methods. 

‘There are a number of books in this field. Why another book? This 
book is uniquely different from many other books in a number of ways. 
The salient features are: 

= Presents a detailed account of process and characteristics of numer- 

ical computing 

п Discusses the concept of computing using modern computers. 

в Places emphasis on the nature of computer arithmetic which is 

different from real arithmetic in а number of ways. 

я "Phe concept, cause and consequence of errors in the application of 

numerical computing have been highlighted. 
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m An overview of FORTRAN 77/90 has been given which would serve 
as an introduction to the beginners and as a reference to those who 
have already learned this language. 

= Mathematical derivation of each method is given to build the read- 
er's understanding of numerical analysis. 

m Wherever possible, algorithms of computing аго given in pseudocode 
using boxes. 

® A variety of solved examples that would sharpen the skills in both 
the theory and application of numerical computing are provided, 

m Computer programs for almost all numerical methods discussed 
have been presented in both FORTRAN 77 and ANSI C languages. 

= А separate diskette version of the book is available; the diskette 
contains the source codes of programs in FORTRAN 77 and C lan- 

ages. 

. | eae are designed using the modern modular and structured 

programming concepts. These modules can be used as building 

blocks in other programs, 

Error analysis of each method is presented. 

Each chapter begins with a statement of need and scope giving a 

preview of what is coming later, 

Each chapter ends with a summary and a set of key terms that 

remind the reader what has been covered and discussed. 

m Review questions provide an opportunity to the reader to test his 
understanding of the concepts. 

m Numerous end-of-chapter practice exercises are an important sup- 
plement to the text. They would not only improve the understand 
ing of algorithms but also enhance the application skills, 

a Programming projects give students ample opportunities to prac- 
tice their skills of scientific programming. 

I have exerted a conscious effort to make the book student-oriented 
and student-friendly. 1 hope the students would find the book not only 
interesting but also useful. 

I wish to acknowledge the many helpful suggestions of reviewers 
which have certainly improved both the content and quality of the mate- 
rial. It is the persuasion and encouragement of Dr N Subrahmanyam 
and Vibha Mahajan of Tata McGraw-Hill that has made the publication 
of this book possible. Thanks are due to K Balakrishnan and J R Pratibha 
whose excellent word processing skills made the preparation of the book 
in its present form much easier and possible. 

The book is full of equations and formulas that contain a large num- 
ber of variables, subscripts, superscripts, etc. І shall be grateful, if the 
readers could communicate to me any errors they discover. 

1 dedicate this book to my wife Sushila who is the inspiration in all 
my work. 


E BALAGURUSAMY 
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INTRODUCTION 


ons play nn indispensable role in solving real life 
mathematical, physical and engineering problems. They have been in 
use for centuries even before digital computers appeared on the scéne. 
Great mathematicians like Gauss, Newton, Lagrange, Fourier and many 
others in the eighteenth and nineteenth centuries developed numerical 
techniques which are still widely used. The advent of digital computers 
has, however, enhanced the speed and accuracy of numerical compu- 
tations, 

What is numerical computing? It is important to understand the answer 
to this fundamental question before we proceed further. Numerical 
computing is an approach for solving complex mathematical problems 
using only simple arithmeti operations. The approach involves 
formulation of mathematical models of physical situations that can be 
Solved with arithmetic operations \It requires development, analysis and 
useofalgorihms.--- — — 

Numerical computations invariably involve a large number of 
arithmetic calculations and, therefore, require fast and efficient computing 
devices. The microelectronics revolution and the subsequent development 
of high power, low cost personal computers have had a profound impact 
on the application of numerical computing methods to solve scientific 
problems, 

The traditional numerical computing methods usually deal with the 
following topics: 

1. finding roots of equations 
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. solving systems of linear algebraic equations 
. interpolation and regression analysis 
numerical integration 

numerical differentiation 

. solution of differential equations 

‚ boundary value problems 

, solution of matrix problems 

In this book we will discuss some of the popular methods available in 
each of these areas. 


ЁТ NUMERIC DATA 


Numerical computing may involve two types of data, namely, discrete 
date and continuous data. Data that are obtained by counting are called 
discrete data. Examples of discrete data are the total number of items in 
a box, or the total number of people participating in a race. 

Data that are obtained through measurement are called continuous 
data. Examples of continuus data are the specd of a vehicle as given by 
a speedometer, or temperature of a patient as measured by à 
thermometer. 
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Analog refers to the principle of solving а problem by-using-2 tool which 
operates in a way analogous to the problem For example, the electronic 
circuits in an-analog-computer act analogously to the problem to-be 
solvi alog computing is based on inputs that vary continuously, 
such as ni, voltage or temperature: The earliest computers were 
analog and functioned on the basis ol fectrical voltages. Calculations 
were performed by adding, subtracting, multiplying and dividing voltages. 
Analog computers are fast, but their accuracy is limited by the precision 
with which the physical quantities can be read. 

Many real life measurable quantities are analog in nature: time, tem- 
perature, pressure, and speed, for instance. Analog methods are pre- 
ferred when these quantities have to be represented in a calculation. An 
example of application of analog computers is a machine used in a postal 
department to convert the weight of a package into the cost of postage 
needed for mailing. 

The basic requirement in the application of analog computers is the 
writing down of differential equations describing the physical system of 
interest, Given the differential equations, the analog result may be 
obtained either by direct method, in which equivalent electrical circuits 
are directly used to. simulate the time variations of the dependent 
variables of the physical system, or the functional method, in which 
electronic circuits perform the mathematical operations indicated by the 
terms of the differential equation. 
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F БЕЙ DIGITAL COMPUTING 


A digital computer is a computing device that operates on inputs which 
are discrete in nature. The input data are numbers (or digits) that may 
represent numerals, letters, or other special symbols. Just as a digital 
clock directly counts the seconds and minutes in an hour, a digital 
computer counts discrete data values to compute the results. 

‘Today's digital computers can cope with the analog information, but 
they have to convert it into digital form. They do this by measuring the 
value of analog quantity at regular intervals and converting that 
measurement into a number of electrical pulses corresponding to that 
measurement. [n an analog watch, for example, time and hands on the 
watch face change continuously; a digital watch, however, converts the 
passage of time into tiny intervals, marked by the numbers changing on 
the dial. 

Digital computers are more accurate than analog computers. Analog 
computers may be accurate to within 0.1 per cent of the correct value, 
whereas digital computers can obtain whatever degree of accuracy is 
required by choosing the correct number of decimal places, They are 
designed to read, store, manage, and output specific units like numbers, 
letters, or punctuation marks. Digital computers are widely used for 
many different applications and are often called general purpose 
computers. 


ROCESS OF NUMERICAL COMPUTING 


v 
Аз ted carlier, numerica! computing involves formulation of 
mathematical models of physical problems that can be solved using basic 
arithmetic operations. The process of numerical computing can be roughly 
divided into the following four phases which are illustrated in Fig. 1.1: 

1. formulation of a mathematical model 

2. construction of an appropriate numerical method 

3. implementation of the method to obtain a solution 

4. validation of the solution 

The formulation of a suitable mathematical model is critical to the 
solution of the ргоМет)А mathematical model can be broadly defined as 
a formülation of certain mathematical equation that expresses the 
essential features of a physical system or process. Models may range 
from a simple algebraic equation to a complex set of differential equations. 
Figure 1.2 shows various types of mathematical equations that might 
result while formulating mathematical models of physical processes. 

The formulation of a mathematical model begins with a statement of 
the problem and the associated factors to be considered, The factors may 
concern the balance of forces and other laws of conservation in physics. 
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Real life problems have many uncertainties and unknowns. It might, 
therefore, be necessary to make certain assumptions for approximating 
and to include only those features of the problem that are considered 
critical to the final solution. An over-simplified model may have only 
limited usefulness. The model may be enhanced later, if necessary. The 
model refinement may make the solution procedure more difficult. We 
must always maintain the balance of enhancement of the model and 
accuracy of the solution required. 

Once a mathematical model is available, our first step would be to try 
to obtain an explicit analytical solution. In most cases, the mathematical 
models may not be amenable to analytical solutions or they may not be 
solved efficiently using analytical techniques. In such cases, we have to 
construct appropriate numerical methods to solve mathematical models. 
As mentioned earlier, a numerical method is a computational technique 
which involves only a finite number of basic arithmetic operations. 

For a given problem, there might be several alternative numerical 
methods. We must consider different factors or trade-offs before selecting 
a particular method—such as type of equation, type of computer available, 
accuracy, speed of execution, and programming and maintenance efforts 
required. 

Modelling is the process of translating a physical problem into a 
mathematical problem. The process involves 

1. making a number of simplifying assumptions 

2. identification of important variables 

3. postulation of relationships between the variables 

This book is mainly concerned with the solution of mathematical models 
using numerical techniques. 


kc ани a mathematical model for predicting the population growth of 


Assumptions: 
Birth and death rates are proportional to population and time interval. 
Parameters: 


P(t) —population at time £ 
AP —increase in population in time interval At 
Then, 
AP = births in А — deaths in At 
= C, Ри) At - CPC) М 
-2(C,-C)PDA ' 


Growth rate 


АР 
—=CP(t 
A ч“) 
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Taking the limits At 0, we get, 
dp 
Ж” C Pt) 
Solution of this differential equation is 
Ри) = Pye 


where P; is the population at time t = 0. 


‘The population growth depends on the growth constant C = C, - C; The 
population will be stable if С, 


The third phase of the numerical computing process is the 
implementation of the method selected. This phase is concerned with 
the following three tasks 

1. design of an algorithm 

2. writing of a program 

3. executing it on a computer to obtain the results 
Once we are able to obtain the results, the next step is the validation of 
the process. Validation means the verification of the results to see that it 
is within the desired limits of accuracy. If it is not, then we must go back 
and check each of the following: 

1, mathematical model itself 

2. numerical method selected 

3. computational algorithm used to implement the method 

This may mean modification of the model, selection of an alternate 
numerical method or improving the algorithm (or a combination of them), 
Once a modification is introduced, the cycle begins again. Figure 1.3 
illustrates how the numerical computing cycle moves from the real world 
to mathematical world and back. 


Real world Mathematical моні 
1 
Physical problem Mathematical 
[= Г problem or model: | 


Validation of model = Solution E 


| 


Fig. 1.3 Another way of looking at ће computing process « 
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CHARACTERISTICS OF NUMERICAL COMPUTING 


methods exhibit certain computational characteristics during 
lementation. It is important to consider these characteristica 


characteristics that are critical to the success of implementation are: 
accuracy, rate of convergence, numerical stability, and efficiency. 


/Accuracy 


Every method of numerical computing introduces errors. They may be 
either due to using an approximation in place of an exact mathematical 


mathematical model was built. Choice of a method is, therefore, very 
much dependent on the particular problem. The general nature of these 
errors will be discussed in detail in Chapter 4. 


Ө of Convergence, 


Many numerical methods are based on the idea of an iterative process. 
This process involves generation of a sequence of approximations with 


discussed in Chapter 4. They are also discussed in various places where 
specific methods are analysed for convergence. 


umerical Stability 


Another problem introduced by some numerical computing methods is 
that of numerical instability. Errors introduced into a computation, from 
whatever source, Propagate in different ways. In some cases, these errors 
tend to grow exponentially, with disastrous computational results. A 
computing process that exhibits such exponential error growth is said to 
be numerically unstable. We must choose methods that are not only fast 
but also stable, 

Numerical instability may also arise due to ill-conditioned problems. 
There are many problems which are inherently sensitive to round off 
errors and other uncertainties, Thus, we must distinguish between sen- 
sitivity of methods and sensitivity inherent in problems. 

When the problem is ili-conditioned, there is nothing we can do to 
make a method to become numerically stable. 
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One more consideration in choosirg a numerical method for solution of a 
mathematical model is e/ficiency. It means the amount of effort required 
by both human and computer to implement the method. A method that 
requires less of computing time and less of programming effort and yet 
achieves the desired accuracy is always preferred. 


EE COMPUTATIONAL ENVIRONMENT 


The last phase of the numerical computing process, namely the 
implementation phase, requires resources such as computer hardware, 
operating system and other systems software, language compilers, actual 
application programs and other software tools *o manipulate data and 
provide output in a desired form. 

The computer hardware may range from a small personal computer 
to a lurge super computer depending on the nature and size of the 
problem, A program may not always produce the same results on two 
different types of computers due to difference in their round off errors. 

Appropriate operating systems and compilers play an important role 
jn developing portable programs. UNIX and MS-DOS have become 
popular operating systems for scientific computing. FORTRAN language 
has dominated the scientific computing field for the last four decades 
and it is expected to continue its predominant role for some more years. 
It has been continuously modified and extended to support the ever 
changing requirements of software engineering. The likely strong 
competitor for FORTRAN in the near future will be C and C++ languages 
which contain some unique features and powerful control structures. 
Portability is another strong point of these languages. 


| NEW TRENDS IN NUMERICAL COMPUTING 


In recent years, the increasing power of computer hardware has affected 
the approach of numerical computing in several ways. It has forced 
scientists and engineers to search for algorithms that are computationally 
fast and efficient. An important new trend is the construction of 
algorithms to take advantage of specialised computer hardware such as 
vector computers and parallel compuiers. Another trend is the use of 
sophisticated interactive graphics, in which the user can view the results 
graphically and advise the computer, graphically, on how to proceed 
further. 

One important development which is likely to have an increasing 
impact on scientific computing 18 symbolic computation. Symbolic 
computation systems would enable us to add, multiply and divide 
polynomials or rational expressions the same way we would do using 
pencil and paper. They can also solve certain mathematical problems 
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without rounding off errors. Symbolic computation is expected to play an 
increasing role in scientific computation. 

Object-oriented numerical computing is gaining importance due to 
the popularity of languages like С++ and Java. They incorporate concepts 
such as encapsulation, inheritance, polymorphism and operator 
overloading. They support the idea that program units should interact 
with one another only through clearly defined interfaces. They also enable 
the extension (or reuse) of the existing code without modifying it. 


7 |] MATHEMATICAL BACKGROUND 


This book assumes that the readers have some mathematical background. 
They require basic knowledge of algebra, functions, matrices, and integral 
and differential calculus. 


[faa SUMMARY 


In this chapter, we have introduced the concept of numerical computing 
and discussed the steps involved in solving a physical problem using 
numerical methods. We also discussed the characteristics of numerical 
computing and computing resources required for implementing a 
numerical method. 


Accuracy lterative process 


Algorithm Mathematical model 
Analog computer Numerical computing 
[4 Numerical method 
CH Numerical stability 
Continuous data Рагайе! computers 
Digital computer Rate of convergence 
Discrate data Round off error 
Efficiency Symbolic computation 
FORTRAN Truncation error 
General purpose computers Validation 
1-сопайіопеа problems Vector computer. 


1. What is Numerical Computing? 
2. Distinguish between analog computing and digital computing. 
3,7 Describe, with the help of a block diagram, the process of numeri- 
cal computing. 
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4. Newton’s second law of motion states that the time rate of change 
of momentum of a body is equal to the resultant force acting on it. 
Using this law, formulate a mathematical model to determine the 
terminal velocity of a free falling body near the earth's surface. 

5. The Newton's law of cooling states that the rate of heat from a 
liquid is proportional to the difference in temperatures between the 
liquid and the surroundings. Formulate a mathematical model to 
govern this law. 

6. When a boat moves through water, the retarding force is proportional 
to the square of the velocity. Formulate a differential equation in 
terms of velocity given the mass m and the drag coefficient k. 

f: State the four characteristics of numerical computing. 

8. What is accuracy? How is it affected during the process of numerical 

computing? 
^ What is convergence? How is it important in numerical computing? 
‚ What do you mean by numerical instability? 

11. Distinguish between sensitivity of methods and Sensitivity of 
problems, 

12. Describe resources required for implementing a numerical computing 
process, 


Introduction to 
Computers and 
Computing Concepts 


INTRODUCTION 


“Aa Chapter 1, we discussed that numerical computing requires two 
important tools, namely, mathematical methods and computers. Most 
numerical methods cannot be solved without the help of computers. 
Therefore, a background knowledge of computers and computing concepts 
will enhance the understanding of implementation of numerical 
computing solutions. This chapter provides some basic information on 
computing en ironment and problem solving approach using computers, 

The spate of innovations and inventions 1n computer technology during 
the last two decades has led to the development of a variety of personal 
computers. They are во versatile that they have become indispensable to 
engineers, scientists, business executives, managers, administrators, 
accountants, teachers and students. They have strengthened humankind’s 
powers in numerical computations and information processing. 

Modern computers possess certain characteristics and abilities pecu- 
liar to them. They can 

1. perform complex and repetitive calculations rapidly and accurately 

2. store large amounts of data and information for subsequent 

manipulations 
3. hold a program of a model which can be explored in many different 
ways 
4, make decisions 
5. provide information to the user 
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3. automatically correct or modify certain parameters of a system 

under control / 

7. draw and print graphs 

3. converse with users interactively 

Lngineers and scientists make use of the high-speed computing 
capability of computers to solve complex mathematical models and design 
problems. Many calculations that were previously beyond contemplation 
have now become possible. But for computers, many of the technological 
achievements, such as landing on the moon, would not have been possible. 

Computers have helped automation of many industrial and business 
systems, They are used extensively in manufacturing and processing 
industries, power distribution systems, airline reservation systems, 
transportation systems, banking systems, and so on. Computer-aided 
design (CAD) and computer-aided manufacture (CAM) are among the 
most popular industrial applications today. 

Modelling and simulation is another area where computers are 
increasingly used. This has greatly accelerated research in such areas ав 
physical and social sciences, medicine, astronomy and meteorology. 

Business and commercial organisations need to store and maintain 
voluminous records and use them for various purposes such as inventory 
control, sales analysis, payroll accounting, resources scheduling and 
generation of management. reports. Computers can store and maintain 
files and can sort, merge or update them as and when necessary. 

The ability of computers to store large amounts of data has led to 
their application in libraries, documentation centres, employment 
exchanges, police departments, hospitals and other similar establishments. 
Computers are used in international games such as the Olympics to 
keep irack of events and provide timely and reliable information and 
documentation to all concerned. 

Since computers can bank a variety of information and converse with 
the users, they are being used as resources in teaching and learning at 
all levels of education and training. This process is known as computer- 
assisted learning (CAL). Here, learners can communicate directly with а 
computer in а conversational mode. Using this mode, a learner can learn 
a topic in his own time and pace. 

Computers are also used to manage the learning processes. This is 
called computer-managed learning (CML). Computers can store students’ 
responses, evaluate their performance and then direct them to the next. 
appropriate learning unit. 

‘The areas of computer applications are too numerous to mention. 
Computers have become an integral part of our everyday life. They 
continue to grow and open new horizons of discovery and application 
such as the electronic office, electronic commerce, and the home computer 
centre. 

The microelectronics revolution has placed enormous computational 
power within the reach of every scientist and engineer. However, it 
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must be remembered that compüters are machines created and managed 
by humans. A computer has no brain of its own. Anything it does is the 
result of human instructions. It is an obedient slave which carries out 
the master’s orders as long as it can understand them, no matter whether 
they are right or wrong. In short, computers lack common sense. These 
instructions constitute the program or software. 


EVOLUTION OF NUMERICAL COMPUTING AND 
COMPUTERS 


The use of computing techniques is over 5000 years old. The Babylonians, 
Chinese, and Egyptians used numerical methods for the survey of lands 
and the collection of taxes as early as 3000 BC. Computing history starts 
with the development of a device called the abacus by the Chinese around 
this period. This was used for the systematic calculation of arithmetic 
operations. Since then the number system has undergone various changes 
and has been used in different forms in computing. The most significant 
development in computing was the formulation of the decimal number 
system in India around 800 AD. Another significant development was 
the invention of logarithm by John Napier in 1614, which made computing 
simple. 

The modern age of mathematics emerged during the 17th century 
when Johannes Kepler and Galileo Galilee deduced the laws for planetary 
motion and Sir Isaac Newton formulated the law of gravity. The 
subsequent developments in mathematics and other sciences increased 
the necd for new computing techniques and devices. 

The principle of logarithm was later applied to a calculating device 
known as the slide rule, which was extensively used till recently. The 
first accounting machine was built in France by Blaise Pascal in 1642. 
Then came the Leibnitz calculator in 1671 designed by Gottfried Wilhelm 
von Leibnitz. These machines progressed in technology and variety and 
became the standard calculating machines of the business community. 
During the beginning of the 19th century, Joseph Marie Jacquard 
invented an automated loom operated by a mechanism controlled by 
punched cards. 

The origin of the modern computer can be traced back to 1834 AD, 
when an English mathematician, Charles Babbage, designed an analytical 
engine. This is considered to be the first programmable digital mechanical 
computer. However, this kind of machine was not built until 1944, when 
Mark I, an electromechanical automatic computer, was developed by 
IBM. Subsequently, a series of technological improvements and 
innovations took place and the design of computets underwent continuous 
and dramatic changes. Some of the important developments since the 
slide rule are given in Table 2.1. 
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Table 21 Some developments in computing technology 
Device 


1622 Slide rule 

1642 Pascal caleulator, an accounting machine by Blaise Pascal 

1671 Leibnitz calculator 

1801 Punched card loom by Jacquard 

1822 Difference engine by Charles Babbage 

1834 Analytical engine by Charles Babbage 

1890 Punched card machine by Herman Hollerith 

1930 Differential analyser by Vannevar Bush 

1936 ^ Paper on computational numbers by Alan Turing 
Link between symbolic logic and electric circuit by Claude Shanon 

1937 Binary adder built by George Stibitz 

1941 First general-purpose computer designed by Konrad Zuse 

1943 Colossus machine built to crack German secret codes, by the British 

1944 First automatic computer, MARK 1, designed by Howard Aiken 

1945 Critical elements of a computer system outlined by John Von 
Neumann 

1946  . First electronic digital computer, ENIAC, put to operation by 
Presper Eckert and John Mauchly 

1947 Transistor invented by John Bardeen, William Shockley and 
Walter Brattain 

1951 First business computer, UNIVAC, became operational 

1956 Second generation computer (using transistors) introduced by 
Bell Laboratory 

1959 ^ Integrated circuits (ICs) demonstrated by Clair Kilby 

1964 First third generation computer using ICs developed 

1965 First commercial minicomputer, PDP-8, introduced by Digital 
Equipment Corporation 

1971 Intel 4004 microprocessor designed by "Ted Hoff 

1974 First fourth generation computer (using microprocessors) built by 
Ed Roberts 

1975 First personal computer software created by Bill Gates and Paul 
Allen 

1977 Apple introduced its famous personal computer 

1981 IBM PC introduced in the market 

1982 ` Cray supercomputer marketed by Cray Research Company 

‚ 1980 Optical computer demonstrated 4 


Modem Computers " 


The era of modern computers began in 1951 when the UNIVAC (Universal 
Automatic Compüter) became operational at the Bureau of Census in 
USA. Since then, computers started appearing i quick succession, each 
claiming an improvement over the бїһег. They represented improvements 
in speed, memory (storage) systéms, input and output devices and 
programming techniques. They also showed a continuous reduction in 
physical size and cost. The developments in computers are closely 
associated with the developments in material technology, particularly 
the semiconductor technology. 
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Computers developed after ENIAC have been classified into the fol- 
lowing four generations: 


First generation 1946 ~ 1955 
Second generation 1956 — 1965 
Third generation 1966 – 1975 
Fourth generation 1976 — present 


You may notice that from 1946, each decade has contributed one 
generation of computers. 

In the first generation computers vacuum tubes were used. Magnetic 
tape drives and magnetic core memories were developed during this 
period. The first generation computers possessed the following drawbacks 
ав compared to the later models: 

1. large in size 

2. slow operating speeds 

3. restricted computing capacity 

4. limited programming capabilities 

5. short life span 

6. complex maintenance schedules 

The second generation computers were marked by the use of a solid- 
state device, called the transistor, in the place of vacuum tubes. These 
machines were much faster and more reliable than their earlier 
counterparts. Further, they occupied less space, required less power, 
and produced much less heat. 

Research in the field of electronics led {0 the innovation of the 
integrated circuits, now popularly known as IC Chips. The use of IC 
chips in the place of transistors gave. birth to the third generation 
computers. They were still more compact, faster and leas expensive than 
tho previous generation. 

Along with the third generation computers, newer and faster 


Fifth Generation Computers 

Japan and many other countries are working on systems that are known 
as knowledge-based or expert systems which will considerably improve 
the man-machine interaction. Such systems would integrate the 
advancements in both hardware and software technologies and would 
facilitate computer-aided problem-solving with the help of organised 
information in many specialised areas. d 
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Table 2.2 Computer generations 


E, Generation 


'eatures = - 
First Second Third Fourth 
Main component Vacuum tube Transistor Integrated LSI and VLST 
circuit. circuit 
(IC Chips). 
Internal storage Electrostatic Magnetic Magnetic Semiconductor 
(Memory) tubes core core memory 
Magnetic 
drum 
External Paper tape Magnetic disk Magnetic disk Magnetic disk 
storage Punched card. Magnetic Magnetic tape Magnetic tape 
(Auxiliary Magnetie баре drum Magnetic drum Magnetic drum 
memory) Magnetic tape Punched card Floppy disk 


Paper tape — Papertape С rom 
Punched card 30,0000 3,00,000 to 

Speed of operation 40 to 300 3,000 to 30,000 3,00,000 30,00,000 

(Additions/second) thousands ^ thousands thousands thousands 


This generation of computers is called the fifth generation computers. 
Although knowledge-based systems are expensive and time-consuming 
to build, they are likely to become more popular in the coming years. 


[887] TYPES OF COMPUTERS 


Computers may be classified based on operating principles, size and 
capability, and applications. 


Principles of Operation 


Based on the operating principles, computers can be classified into any 
one of the following types: digital computers, analog computers, and 
hybrid computers. 

Digital computers operate essentially by counting. All quantities are 
expressed as discrete digits or numbers. Digital computers are useful for 
evaluating arithmetic expressions and for manipulations of data (such 
as preparation of bills, ledgers, solution of simultaneous equations, etc.). 

Analog computers operate by measuring rather than by counting. The 
name, which is derived from the Greek word analog, denotes that the 
computer functions by establishing similarities between two quantities 
that are usually expressed as voltages or currents. Analog computers 
are powerful tools to solve differential equations. Computers which 
combine features of both analog and digital types are called hybrid com- 
puters. 
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A majority of the computers used today are digital. As their name 
suggests, digital computers were originally designed to perform certain 
numerical calculations. They gradually replaced almost. all mechanical 

ulating devices. Later, the concept of stored programs enabled them 
to ‘store data and instructions and perform certain sequences and 
combinations of arithmetic operations automatically. This has led to the 
use of digital computers in a variety of applications. 


Applications 


Modern computers, depending upon their applications, are classified as 
special purpose computers or general purpose computers. 

Special purpose computers are tailor-made to cater solely to the 
requirements of a particular task or application. They incorporate the 
instructions needed into the design of internal storage so that they can 
perform the given task on a simple command. They, therefore, do not 
possess unnecessary options and cost less. 

On the other hand, general purpose computers are designed to meet. 
the needs of many different applications. In a general purpose computer, 
the instructions needed to perform a pariicular task are по! wired 
permanently into the internal memory. When one job is over, instructions 
for another job can be loaded into the internal memory for processing. 
Thus, a general-purpose machine can be used to prepare pay-bills, manage 
inventories, print sales reports, and so on 


Size and Capability 


Computers are also available in different sizes and with different 
capabilities. Broadly, they may be categorised ns microcomputers, 
minicomputers, mainframes und supercomputers. The selection of a 
particular system primarily depends on the volume of data to be handled 
and the speed of the processor. 


Microcomputers A microcomputer is the smallest general-purpose 
Processing system. Functionally, it is similar to any other large system. 
Microcomputers are self-contained units and are usually designed for 
use by one person at a time. Since microcomputers can be easily linked 
to large computers, they form a very important segment of the integrated 
information systems, 


Minicomputers A minicomputer is a medium-sized computer that is 
more costly and powerful than a microcomputer. An important distinction 
between a microcomputer and a minicomputer is that the latter is usually 
designed to serve multiple users simultaneously. A system that supports 
multiple users is called multiterminal, time-sharing system. Mini- 
computers are the popular computing systems ато 15 research and 
business organisations today. 
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Mainframe computers Computers with large storage capacities and 
very high speed of processing (compared to micro or minicomputers) are 
known as mainframe computers. They support a large number of termi- 
nals for use by a variety of users simultaneously. They are also used as 
the central host computer in distributed data processing systems. 


Supercomputers Supercomputers have extremely large storage сарасі- 
ties and computing speeds that are many times faster than other com- 
puters. While the speed of traditional computers is measured in terms of 
millions of instructions per second (mips), a supercomputer is.rated in 
tens of millions of operations per second (mops) (an operntion is made up 
of numerous instructions). Typically, the supercomputer is used for large- 
scale numerical problems in scientific and engineering disciplines. These 
include applications in electronics, petroleum engineering, weather fore- 
casting, structural analysis, chemistry, medicine and physics, 


Personal computers Personal computers are nothing but micro- com- 
puters that are specially designed for personal use of individuals. The 
name “personal computer” was coined by IBM when it marketed its first 
microcomputer in 1981. Since then, many companies have produced IBM 
compatible PCs. During the last fifteen years, the processor chips used 
in IBM compatible PCs have undergone dramatic improvements in their 
performance characteristics. Table 2.3 shows the characteristics of vari- 
ous PC processor chips. Note that today’s PCs are far more powerful 
than the mainframes of just a few years ago. 


Table 2.3 Characteristics of microprocessor chips 


8088 286 Pentium Pentium 
PCXT PCAT 386 486 Pentium Pro п 


Clock speed 477 6-12 16-33 16-50 66-200 120-200 200+ 
(megahertz) 

Data path В 32 32 64 64 64 
(bits) 16 (SX) 

Computation 16 32 32 32 32 32 
size (bits) 


Memory-size 640K 2 megs 4-16 4-64 4-64 16-64 16-64 

(bytes) megs  megs  megs  megs megs 

Floating point Copro- Copro- Сорго On On On On 
cessor cessor cessor chip chip chip chip 
0.33 12 26-6 20-40 112 250 500 


29,000 130,000 275,000 1.2 33 5.2 10+ 
million million million million 


Workstations There is а class of computers, known as workstations, 
which lie in between minicomputers and microcomputers in terms of 
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processing power. A workstation looks like a personal computer but is 
specially designed for engineering and graphics applications. 

Paralle! computers | Parallel computer is a relatively new type of com- 
puter that uses a large number of processors. The processors perform 
different tasks independently and simultaneously, thus, improving the 
speed of execution of complex programs dramatically. Parallel computers 
match the speed of supercomputers at a fraction of the cost. 


COMPUTING CONCEPTS 


“A computer, small or big, is basically a device used for processing of data 
(numbers) and text (words). It performs essentially the following three 
operations in a sequence: 


1. receives data (and instructions) 
2. processes data (as per the instructions) 
3. outputs result (information) 


This cycle of operation of a computer is known as the input-process— 
vulput cycle and is shown in Fig. 2.1. 


Process. 


iom input Омри | Mr. Brown bought а] 
== chair from the markat 
for his office 
Ma Manat INFORMATION 
“DATA «ж ie 


Fig. 2.1 Input-process-output cyclo 


Raw facts, known as daía, are provided to the computer in bita and 
pieces. They are encoded in such a way that the computer can understand 
them. The computer then processes the data with the help of certain 
instructions provided to it, and produces a meaningful and desired out- 
put known as information. For example, if the data consists of two 
numbers, say, 10 and 15 and the instruction is to add them and print 
out the result, then the output information would be the sum of the two 
numbers, i.e. 25. A set of instructions designed to perform a particular 
sequence of functions is called a computer program. 

Processing is nothing but manipulation of data in accordance with 
certain procedures to suit the need of the user (or application). The same 
basic data can provide several kinds of information depending upon the 
type of instructions. 

Input is usually through a keyboard (like a typewriter) and output 
may be obtained either on a display screen or on a printer. While the 
printer produces typed copy on paper (usually known as hard copy), the 
screen display (soft copy) allows the user to verify the output before it is 
printed. 
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A computer often includes an external storage system to store (and 
retrieve) data and programme. The popular storage medium is a floppy 
disk. Other media, such as hard disks, magnetic tapos and CD ROMs 
are also used. All these physical components are known as hardware. 


" aR COMPUTER ORGANISATION 


Although computers differ widely in their details, all of them follow a 
basic organisational structure as shown in Fig. 2.2. In order to carry out 
the three basic operations, namely, input, process and output. a computer 
includes the following hardware components: input devices, processing 
units, output devices and external storage devices. 


Magnetic 
Tape 


К | EXTERNAL STORAGE UNITS 


Input “input Output 
Media Unit Media 
на 


Data and results flow 
Control instructions to units 
re Instructions to control unit 
Fig. 2.2 Structure of a computer 


Input Devices 

An input device presents data to the processing unit in machine-read- 
able form: Although the keyboard is a common input device for a small 
computer, a system may also support one or more of the input devices 
given in Table 2.4. 
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Table 24 input devices 


Device Medium of data storage Remarks 
Optical character Special paper document Only input 
reader (OCR) 

Magnetic ink character Special paper document ^ Only input 

recogniser (MICR) 

Mark sense reader Special paper or card Only input 

Graphics tablet Document Only input 

Mouse Document Only input 

Floppy drive Floppy disk Input, output, 
storage 

Hard disk Magnetic disk Input, output, 

(Winchester) drive storage 

Tape drive Magnetic tape Input, output, 
storage 

CD ROM drive CD ROM Input, storage 

focessing Units 


Processing units receive data and instructions, store them temporarily 
and then process the data as per the instructions. The Processing units 
include: memory unit, arithmetic logic unit, and control unit. All three 
units together are known as the central processing unit (CPU), 


Memory unit The memory unit holds (stores) all data, instructions and 
results temporarily. The memory consists of hundreds of thousands of 
cells called ‘storage locations’, each capable of storing one word of informa- 
tion, The memory unit is called by different names, such as storage, 
internal storage, primary storage, main memory or simply memory. 


Arithmetic logic unit This unit is used to perform all the arithmetic 
and logic operations, such as addition, multiplication, comparison, etc, 
For example, consider the addition of two numbers A and B. The control 
unit will select the number A from its location in the memory and load it 
into the arithmetic logic unit. Then it will select the number B and add 
it to A in the arithmetic unit. The result will then be stored in the 
memory or retained in the arithmetic unit for further calculations. 


Control uni! This unit coordinates the activities of all the other units in 
the system. Its main functions are: 

1. to control the transfer of data and information between various 

units 

2. to initiate appropriate actions by the arithmetic unit 

The program provides the basie control instructions. Conceptually, 
the control unit fetches instructions from the memory, decodes them, 
and directs various units to perform the specified tasks. 


Output Devices 


Jutput devices receive information from the CPU and present it to the 
"ser in the desired form. Although a printer is the most commonly used 
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output device, devices such as plotters are also becoming pepular. Some 
common output devices are given in Table 2.5. 


Table 2.5 Output devices 
Device Medium of presentation Remarks ` 


Printer Paper Only output. 
Plotter Paper Only output 
Visual display unit (VDU) Display screen Only output 
Floppy drive Floppy disk Input, output, storage, 
Disk drive Magnetic disk Input, output, storage 
"Tape drive Magnetic disk Input, output, 


External Storage Devices 


The purpose of external storage is to retain data and programs for future 
use. For example, а program may be required at regular intervals. If 
such information is stored in an external storage media, then one can 
retrieve it as and when necessary, thus avoiding the need to type it 
again. Any number of files containing information can be stored on 
external media. Since they are permanent (they are not erased when the 
equipment is turned off), one can store long files on external media, and 
later on work on them in sections, keeping all the sections in storage 
except the one currently in use.  - 

The popular external storage media used with micro and mini 
computers are floppy disks, hard disks and CD ROMs. 


Floppy disks The most common storage medium used on small com- 
puters today is a /loppy disk. It is a flexible plastic disk coated with 
magnetic material and looks like a phonograph record. Information can 
be recorded or read by inserting it into a disk drive connected to the 
computer. The disks are permanently encased in stiff paper jackets for 
protection and easy handling. An opening is provided in the jacket to 
facilitate reading and writing of information (Fig. 2.3). 


Label for identification 


(<— Write protect 
notch 


@<|—— Index hole 


Fig. 2.3 Floppy disk (6.25 inch) 
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Floppy disks are available in two standard sizes—5.25 inch and 3.5 
inch. The 3.5 inch floppy disk, which was introduced later, can store 
more information than the previous one. 


Hard disks Another magnetic media suitable for storing large volumes 
of infc tion i i 


pack is Permanently sealed inside a casing to protect it from dust and 
other contaminations, thus increasing its operational reliability and data 
integrity. 

Winchester disks possess a number of advantages compared to floppy 
disks: 

1. They can hold much larger volumes of information than floppies, 


capacities of 260, 540, 680, 1000, 1200, 2000 megabytes are typical on а 
personal computer, 


CD ROMs Compact. disk read-only memory (CD ROM) disks are used 


to distribute large volumes of data and text. Computer programs and 
user manuals are often distributed on CD ROMs, 


DRIVING THE compute THE SOFTWARE 


Computers need clear-cut instructions to tell them what to do, how to 
do, and when to do. A set of instructions to carry out these functions is 
called a program. A group of such programs that are put into а computer 


Software is an essential requirement of computer systems. Just as а 
car cannot run without fuel, a computer cannot work without software. 


in Fig. 2.4: operating system, utility programs, language processors and 
application programs. 

Software is intangible but resides on or is stored in something tangible, 
such as floppy disks and magnetic tapes, 


Operating System 
The software that manages the resources of a computer system and 
schedules its operation is called the operating system. The operating 
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Application packages 


Language processors/utiity 
programs 


| 
| | орнай fe 


—= 
Hardware 


E 


Fig. 2.4 Loyers of softwore 


system acts as an interface between the hardware and the user programs 
and facilitates the execution of the programs (Fig. 2.4), The principal 
functions of operating system include: 
1. to control and coordinate peripheral devices such аз printers, display 
screen and disk drives я 
to monitor the use of the machine's resources 
. to help the application programs execute its instructions 
„ to help the user develop programs 
. to deal with any faults that may occur in the computer and inform 
the operator 
The operating system is usually available with hardware 
manufacturers and is rarely developed in-house owing to its technical 
complexity. Small computers are built from a wide variety of micro- 
processor chips and use different operating systems. Hence, an operating 
system that runs on one computer may not run on the other. The popular 
operating systems include, among others, MS DOS and UNIX. 


gie ою 


Utility Programs 


There are many tasks common to a variety of applications. Examples of 
such tasks are: 

1. sorting a list in a desired sequence 

2. merging of two programs 

3. copying a program from one place to another 

4. report writing 
One need not write programs for these tasks. They are standard, and 
normally handled by utility programs. 

Like operating systems, utility programs are pre-written by the 
manufacturers and supplied with the hardware. They may also be 
obtained from standard software vendors. A good range of utility programs 
can make life much easier for the user. 


26 Numerical Methods 


Language Processors 

Computers can understand instructions only when they are written in 

their own. language called the machine language. Therefore, а program 

written in any other language should be translated into machine lan- 

guage. Special programs called language processors are available to do 
job. 


There are two forms of translators: compilers and interpreters. 

A compiler checks the entire user-written program (known as the 
Source program) and, if error-free, produces a complete program in ma- 
chine language (known as object program). The Source program is re- 
tained for possible modifications and corrections and the object program 
is loaded into the computer for execution. 

An interpreter does а similar job but in a different. style, The inter- 
preter (as the name implies) translates one statement at a time and, if 
error-free, executes the instruction, This continues till the last state- 
ment. Thus an interpreter translates and executes the first instruction 
before it goes to the second, while a compiler translates the entire pro- 


2. Interpreters take more time for the execution of a program compared 
to compilers because a statement has to be translated every time it 
is read, 

Compilers and interpreters are usually written and supplied by the 

hardware vendors. Since a compiler (or an interpreter) can translate 
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In some situations one may have to develop one's own programs to 
suit one's unique requirements. Once developed, they come into the 
category of unique application packages, 


С |] PROGRAMMING LANGUAGES 


The functioning of a computer is controlled by a set of instructions 
(called a computer program). These instructions are written to tell the 
computer: 

l. what operation to perform 

2. where to locate data 

3. how to present results 

4. when to make certain decisions 

The communication between two parties, whether they are machines 
or human beings, always needs a common language or terminology. The 
language used in the communication of computer instructions is known 
a8 the programming language. The computer has its own language and 
any communication with the computer must be in its language or 
translated into this language. 

Three levels of programming languages are available, They are: 

1. machine languages (low level languages) 

2. assembly (or symbolic) languages 

3. procedure-oriented languages (high level languages) 


Machine Language 


Computers are made of two-state electronic components which can 
understand only pulse and no-pulse (or ‘1’ and ‘0’) conditions. ‘Therefore, 
all instructions and data should be written using binary codes 1 and 0. 
The binary code is called the machine code or machine language. 

Computers do not understand English, Hindi or Tamil. They respond 
only to machine language. Added to this, computers are not identical in 
design. Therefore, each computer has its own machine language. 
(However, the script 1 and 0, is the same for all computers). This poses 
two problems for the user. 

First, it is difficult to understand and remember the various 
combinations of l's and 0's representing numerous data and instructions. 
Also, writing error-free instructions is a slow process, 

Secondly, since every machine has its own machine language, the 
user cannot communicate with other computers (if he does not know its 
language). Imagine a Tamilian making his first trip to Delhi. He would 
face enormous obstacles as the language barrier would prevent him from 
communicating. 


Assembly Language 


An assembly language uses mnemonic codes rather than numeric codes 
(as used in machine language). For example, ADD or A is used as a 
symbolic operation code to represent addition and SUB or S is used for 
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subtraction. Memory locations containing data are given names such as 
TOTAL, MARKS, TIME, MONTH, etc. 

As the computer understands only machine code instructions, a 
program written in assembly language must be translated into machine 
language before the program is executed. This translation is done by a 
computer program referred to as an assembler. 

The assembly language is again a machine-oriented language and 
hence, the program has to be different for different machines, The 
programmer should remember machine characteristics when he prepares 
a program. Writing a program in assembly language is still a slow and 
tedious task. 


Procedure-Oriented Language (POL) 


These languages consist of a set of words and symbols and one can write 
programs using these in conjunction with certain rules. These languages 
are oriented toward the problem to be solved or procedures for solution 
rather than mere computer instructions. These are more user-centered 
than the machine-centered languages. They are better known as high- 
level languages. 

The most important characteristic of a high-level language is that it is 
machine-independent and a program written in a high-level language 
can be run on computers of different makes with little or no modification 
The programmer need not know the characteristics of that machine. 
However, such programs need to be translated into equivalent machine 
code instructions before actual implementation. 

A program written in a high-level language is known as the source 
program and can be run on different machines using different translators, 
‘The translated program is called the object program. The major 
disadvantage of high-level languages is that they take extra time for 
conversion and thus, are less efficient compared to the machine-code 
languages. Figure 2.5 shows the system of implementing the three levels 
of languages. 


^ 
High-level 
language 


Results 


Fig.2.5 implementation of c program 


Introduction to Computers ond Computing Concepts 22 


Common High-level Languages 


Many high-level languages have been developed during the last three 
decades. The most common high-level languages are FORTRAN, BASIC, 
COBOL, C, PL/i, C++ and Java. Although, they are less efficient than 
the machine or assembly languages, they relieve the programmers of the 
tedious task of remembering numeric codes for storage locations, 
operations, etc. In addition, these languages are easier to learn and use. 

The choice of a language depends upon many factors such as the 
knowledge of the programmer, the computer, the problem to be solved, 
etc. The languages that are used more popularly are given in Table 2.6, 


Table 2.8 Summary of common high-level languages 


Year Language  Namederied Developed by Application 


from 
1957 FORTRAN FORmula IBM Science, 
‘TRANslation engineering 
1958 ALGOL ALGOrithmic International Science, 
Language group engineering 
1959 LISP LISt Processing МІТ, USA Artificial 
engineering. 
1960 APL A Programming IBM Science, 
Language engineering 


1961 COBOL COmmon Business Defence Dept., Business 
Oriented Language USA 


1964 BASIC Beginner's All Dartmouth Engineering, 
purpose Symbolic College, USA science, business, 
Instruction Code education 
1965 PL/I Programming IBM General 
Language 1 
1970 Pascal Blaise Pascal Federal Institute General 
of Technology, 
Switzerland 
1972 PROLOG PROgrammingin University of Artificial 
LOGic Marseille intelligence 
1973 С Earlier language Bell Laboratory General 
called B 
1975 Ada Augusta Ada Byron U.S. Defence Dept. General 
1983 C++ Language C Bell Laboratory General, 
object-oriented 
programming 
1991 Java None Sun Microsystems General, internet, 
object-oriented 


programming 
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Бажа INTERACTIVE COMPUTING 


A major breakthrough in programming took place in the early 1960s 
when interactive languages like BASIC were developed. With an 
interactive language, we can converse (interact) with a computer. Most 
of the modern languages including FORTRAN have incorporated 
interactive features. With the help of an interactive language, we may 
engage in a conversation with our computer like this: 


1 am computing sum of two values 
Please input value of x 

252.75 

Please input value of Y 

120.50 

Sum of X any Y is 376.25 

Do you want me to do опе more sum? 


No Thanks 


Bye then, See you again! 


The lower-case words are of the computer and the words underlined are 
ours. Such interactive computing would be useful in determining certain 
intermediate results and taking actions depending upon the values. 


2221 PROBLEM SOLVING AND ALGORITHMS 


Mathematical problems that can be solved through the computer may 
range in size and complexity. Since the computer does not possess any 
common sense and cannot make any unplanned decisions, the problem, 
whether it is simple or complex, has to be broken into a well-defined set 
of solution steps. It should be remembered that computers do not “solve” 
problems; rather, they are used to implement the solutions to problems, 
In every instance of problem solving, the computer cannot be used to 
solve the problem until a method of solution has been evolved and a 
detailed procedure has been prepared by the user. It is assumed that the 
user has a certain amount of background knowledge, knows certain facts 
about the problem and possesses sufficient deductive and reasoning skills. 
Problem solving involves the following steps: 
1. studying the problem in detail 
2. redefining or restating the problem 
3. identifying output requirements, input data available and conditions 
and constraints to be used 
- comparing alternative methods of solution 
. selecting the method which is considered to be the besi 
. preparing a logical and concise list of procedures or steps necessary 
for determining the solution 
7. computing the results 
8. examining the results for correctness 


oos 
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The computer's help may be necessary only in the seventh step. АЙ 
the remaining steps are Lo be performed by the user. It is this fact that a 
beginner finds difficult to appreciate. 

The logical and concise list of procedure for solving a problem is called 
an algorithm. Tt. describes the steps that lead to unambiguous results in 
a finite number of operations. Figure 2.6 illustrates an algorithm for 
finding the square root of a set of N numbers. 


Step 1: Find out the number of values tor which square roots are to be 
evaluated. 

Step 2: Take a value. 

Step 3: See whether the value is positive or negative. If positive, go to 
Stop 4, otherwise go to Step 6. 

Step 4: Evaluate Ihe square root. 

Step 5: Record the value and its square root. 

| Step 6: Repeat Steps 2 to 5 until all the values are completed. 


Fig. 2.6 Aigorilhm for finding the square root of a given set of values 


An algorithm prepared for the first time might need review to: 

1, determine the correctness of various steps 

2. reduce the number of steps, if necessary 

3. increase the speed of solving the problem 

An algorithm should also include steps to identify any abnormal data 
or results and take corrective measures, if possible. In case of large 
problems, we can break them into parts representing small tasks, prepare 
several algorithms and later combine them into one large algorithm. 
This is known as the modular approach. 

Developing computer programs using the modular approach is known 
as modular programming. A module is a program unit or entity that is 
responsible for a single task. Modules (known as subprograms) are 
arranged in a hierarchical structure (similar to an organisation chart) ав 
shown in Fig. 2.7. This is essentially top-down design in which bigger 
modules are broken into smaller ones such that they are small enough 
to be understood and easily coded using simple logic. 


Main program | 


Subprogram 1 [Гзта | | subprogram 3 
Г =] 
===] =] 


Fig. 27 Top-down modular design of a program 
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I3] FLOW CHARTING 


When organising à problem for computer solution, it is desirable to 
present the algorithm pictorially. A flow chart is a diagram that outlines 
the sequences of operations to be performed. The operating steps are 
placed in boxes that are connected by arrows to indicate the order of 
execution of steps. Figure 2.8 illustrates the flow chart for the algorithm 
shown in Fig. 2.6. It is perhaps the best available method for expressing 
what the computer must do. Some symbols commonly used in flow charts 
are shown in Fig. 2.9. 


Write the value and 
its square гоо! 


Fig. 2.8 Flow chart for finding the square root of a given set of numbers 


The important functions of a flow chart are as follows: 

1. It provides a graphic representation of the problem so that it is 
easier to understand the plan of solution. 

2. It provides & convenient aid to writing computer instructions 
(program). 
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3. It assists in reviewing and correcting the program. 

4. It helps in discussion of the solution logic with others. 

While drawing a flowchart, one must remember the following: 

1. First list the logical steps. 

2. Complete the main path of the logic first and then complete all 
branches and loops. 

3. Use descriptive terms or mathematical equations in the boxes. 

4. Each box should represent a step that is meaningful. 

5. Use unambiguous terms in the flow chart so that others can easily 
understand it, 


ect Start or end of the program 


С Computational steps 


аа / Input or outpul instructions 


Decision-making and branching 


С> кшш 


Q Connector or Joining of two parts of program 


Comcs pes Flow of control 
Fig. 2.9 Flow char! symbols 


[ STRUCTURING THE LOGIC 


Solution steps of all problems can be organised into one or combination 
of the following three control structures: 

1. sequence structure 

2. branching structure 

3. looping structure 

Sequence structure is used when the solution does not involve any 
repetitive operations or options. This is known as straight-line logic and 
is illustrated in Fig. 2.10. 

Branching refers to the process of. following one of two or more alternate 
paths of computations. This happens at а point where a test is performed 
to identify the conditions of certain variables in the process. The basis 
for selecting a particular path is stated within the decision box. The 
decision can be based on a comparison, on the value of a variable, on the 
sign of a variable, etc. The basic flow charts associated with branching 
are shawn in Fig. 2.11. 
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(а) (b) 
Fig. 2.11 Branching structure (IF THEN ELSE) 


In Fig. 2.11(a), a few steps are bypassed and the program is rejoined 
at a later stage. This is known as forward jump. In Fig. 2.11(b), each 
branch contains one or more computational steps. The two branches 
may join up again in the main path or may contain completely different 
steps and only join up at the end. 

Looping refers to the repeated use of one or more steps. There are two 
types of loops. One is the fixed loop where the operations are repeated a 
fixed number of times. In this case, the values of the variables inside the 
loop have no effect on the number of times that the looping operation is 
performed. The other is the variable loop where the operations are re- 
peated until a specified condition is met. Here, the number of times that 
the loop is repeated may vary. Searching for a particular item in a list of 
items is an example of variable loop. 
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Loops are also referred to as backward jumps. These jumps may occur 
either after meeting a specified condition in the process or after doing a 


certain computation. These jumps (loops) are illustrated in Fig. 2.12. 


Compute 


(a) Do and Test (DO UNTIL) 


(b) Test and Do (DO WHILE) 


O-——— 


Compute 


este — 


Yes 
(с) A mixed юор 
Fig. 2.12 ilustration of loops 
USING THE COMPUTER 


Computers can be used to solve specific problems that may be scientific 
or commercial in nature. In either case, there are some basic steps 
involved in using the computers. These are as follows: 
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1. 


Problem analysis Identify the known and unknown parameters 
and state the constraints under which the problem is to he solved. 
Select a method of solution. 


. Collecting information Collect data, information and the documents 


necessary for solving the problem and also plan the layout of output 
results. ^. 


. Preparing the computer logic Identify the sequence of operations 


to be performed in the process of solving the problem and plan the 
program logic, preferably using a program flow chart. 


‚ Wriling the computer program Write the program of instructions 


for the computer in a suilable language. 


. Testing the program There may be errors (bugs) in the program. 


Remove all these errors which may be either in using the language 
or in the logie. 


. Preparing the data Prepare input data in the required form. 
. Running the program This may be done either in batch mode or 


interactive mode. The computations are performed by the computer 
and the results are given out. 


‘The selection of a particular input/output. device depends upon the 
nature of the problem, type of input data and the form of output required 


ZR] SUMMARY 


We have discussed in this chapter the following aspects of computers 
and computing technology: 


evolution of computing devices 

generations of modern computers 

different types of computers 

input-process-output cycle of computing 

organisation and structure of a computer 

functions of various input, output and storage devices 

need for various types of computer programs 

importance of programming languages and their applications 
steps involved in solving mathematical problems 

use of flow charts for representing problem-solving algorithms 
application of modular and structured programming techniques for 
implementing computer-based solutions 


Key Terms 


Low leve! language 
Machine code 
Machine language 
Mainframe computer 
Marki 
Microcomputer 


(Contd.) 
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(Contd.) 
Binary code Microelectronics 
Branching structure Microprocessor 
Compiler Minicomputer 
Computer program Modelling 
Computer-aided design Modular programming 
Computer-aided iearning Object program 
Computer-aided manufacture Operation system 
Computer-managed learning Paralle! computer 
Data PCAT 
Debugging РСХТ 
Digital computer Pentium 
ENIAC Personal computer 
Expert systems Second generation 
Fifth generation Sequence structure 
First generation Simulation 
Floppy disk Slide rule 
Flow chart Software 
Fourth generation Source program 
Hard disk Straight-line logic 
Hardware Structured programming 
High-level language Supercomputer 
Hybrid computer Third generation 
IC chips Top-down design 
Information Transistor 
Interactive computing Translators 
Interpreter UNIVAC 
Knowledge-based systems Utility programs 
Language processors Vacuum tube 
Large-scale integration (LSI) Winchester disk 
Logarithm Workstation 


Looping structure 


4 


1. Describe the abilities of modern computers that are directly relevant. 


to numerical computing 


2. List at least two applications of computers in each of the following 


areas: 
(a) Industry 
(b) Business 
(с) Education 
(d) Engineering 


3. Match the items in the following lists: 


(a) China 
(b) John Napier 
(c) Blaise Pascal 


Punched Cards 


(ii) Accounting Machine 
(ш) Abacus 
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4. 
5. 


в. 


22. 
23. 
24. 
25. 


26. 


Numerical Methods 
(d) IBM (iv) Logarithm 
(e) Jacquard (v) Mark I 


Describe briefly the developments in computing technology during 
the three decades from 1945 to 1975. 

Describe the technology of fourth generation computers. How are 
they better than the earlier computer models? 

What are fifth generation computers? How are they different from 
fourth generation systems? 

Distinguish between analog and digital computers. 

Distinguish between special purpose and general purpose computers, 
A majority of computers used in the world today are digital. Why? 
What are personal computers? How are they different from 
microcomputers? 


‚ Describe the relevance of supercomputers to engineers and scientists, 


Define each of the following terms in one sentence: 
(a) Computer Program 
(b) Hardware 
(c) Information 
(4) Data 
(e) Software 
Describe the functions of the following units in a computer: 
(a) Memory Unit 
(b) Arithmetic Logic Unit 
(c) Storage Unit 


- Describe how an application program is implemented in a computer. 


Why do we need language processors? Deserihe the two forms of 
language processors available. 

Compare the functions of application programs with that of operating 
systems, 


‚ What is machine language? What are its limitations? 
. How is assembly language better that machine language? 


What are the features of high-level languages? 
How is a program written in a high-level language implemented on 
а computer? 


- State the contributions of the following organisations to the 


development of high-level languages: 

(a) IBM 

(b) Bell Laboratory 

(c) US Defence Department 
What are the advantages of interactive computing? 
State the main steps involved in solving a mathematical problem. 
What is an algorithm? How is it useful for a programmer? 
What is modular programming? How does it help in solving à 
problem? 
Why do we often use flow charts for developing computer pro- 
grams? 
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27. Describe the three basic control structures used in executing the 
solution steps. 
28. Critically compare the Do-and-Test and Test-and-Do looping struc- 
tures. 
29. Compare the following: 
(a) Forward jump versus backward jump 


30. Describe the basic tasks involved in solving a problem using a 
computer. 


Computer Codes and 
Arithmetic 


a ET Л 


1 INTRODUCTION 


Computers store and process numbers, letters and words that are often 
referred to as data. How do we communicate these numbers and words 
to computers? How do computers siore this data and process them? 
Since computers cannot understand the Arabic numeral or English 
alphabet, we should use some “codes” that can easily be understood by 
them. 

In all modern computers, storage and processing units are made of a 
set of silicon chips, each containing a large number of transistors, А 
transistor is a two-state device that can be put "off" and “on” by passing 
an electric current through it. Since the transistors are sensitive to 
currents and act like switches, we can communicate with the computers 
using electric signals, which are represented as а series of “pulse” and 
“no-pulse” conditions. For the sake of convenience and ease of use, а 
pulse is represented by the code “1” and a no-pulse by the code “0”. They 
are called bits, an abbreviation of “binary digits”. A series of 1's and 0's 
are used to represent a number or a character and thus, they provide a 
way for humans and computers to communicate with one another. This 
idea was suggested by John Von Neumann in 1946, The numbers 
represented by binary digits are known as binary numbers. Computers 
not only store numbers but also perform operations on them in binary 
form. Although information is stored in the computer memory in 
combinations of 0's and 15, binary numbers become cumbersome when 
expressing large numbers. For this reason, internai contents of a computer 
are not displayed in binary form. Instead, they are displayed as 
hexadecimal or octal systems. Number systems that are popularly used 
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in computing are the decimal system, binary system, hexadecimal system 
and octal system. 
Та this chapter, we will discuss the various number systems and their 


EZ DECIMAL SYSTEM 


The decimal number system, so familiar to us, is the oldest positional 
number system. In a positional system, a number is represented by a set 


position. The actual number of symbols used in a positional system 
depends on its base, 


Position — + 6 5 o4 4d 4 + „ 
Place Value — 4. 496 10 105 10 102 10! 10° 
For example, the decimal number 5704 represents: 


3 2 1 O -——— Position 
10" 0 шш jp —— — Value 
5 7 0 1 — —7— — Decimal point 
| |. L—— {1% a 4 
-———» 0x10! = 0 


Sum 5704 
We can express this in general form as 
m 
d, (10) +d, | (10"-1) 4 5+4 = Yd, 10: 
20 
where d, are the decimal symbols, 0 to 9 and m — 1 are the number of 
symbols. This is called the expanded. notation for the integer. 
Similarly, а fractional part of a decimal number can be represented as 
n 
Xd, 10% 
ii 


where » is the number of symbols in the fractional part, 
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EER BINARY SYSTEM 


The binary system is the positional number system to the base 2. Il uses 
two symbols 0 and 1. Again, each position in a binary number represents 
a power of the base as shown below. 
Position c— B 5 4 3 2 1 0 
іса манн == A Ж 4€ 3 35 ш Y 
(64; (32) (16) (8) (4) (2 a) 
Note that each successive position in the integer part of the binary 
number has a value two times greater than the position to its right. 
For example, the binary number 1101 represents the decimal values 
as shown below: 


3 2 1 0 =— Position 
? 9 2 2 -——— Value 
1 1 0 1 -—— Binary point 
| Lo. 1x1 я 1 
le yl = @ 
——————e€ 154 = 4 
== — 1х8 = 8 
Sum 13 — 


That is, 1101, = 13,6 
The subscript 2 denotes a number in binary system and 10 denotes a 
number in the decimal system. In general form, it can be written as 


d, (2") «d, , (777) + t do Ya? 
i-ü 


where d; are the binary symbols, 0 or 1. We can further generalise the 
notation to any base b ав уй, 6! 

ifm 
Note that the base b is usually an integer greater than one, and digits d, 
are between 0 and b — 1, The base is sometimes called radix and the 
fractional point is called radix point. 


Esa] HEXADECIMAL SYSTEM 


The hexadecimal system is a number system that uses 16 as its base. 
This system requires 16 one digit symbols. The first ten symbols are 
represented by digits 0 through 9 and the remaining six by the letters A. 
through F. The letter A denotes 10, B denotes 11 and so on. Table 3.1 
shows equivalents of decimal, binary, and hexadecimal values. 
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Table 3.1 Equivalent values of different systems 


Decimal Binary System Hexadecimal 
System Weight — 8 4 2 т System 
0 0 0 0 0 | Жаш 
1 0 0 0 1 1 
2 0 0 1 o 2 
3 0 о 1 1 3 
4 0 1 o 0 4 
5 0 1 0 1 5 
6 0 1 1 о 6 
т o 1 1 1 1 
8 1 0 0 0 8 
9 1 o 0 1 9 
10 1 LU 1 0 A 
п 1 0 1 1 B 
12 1 1 0° 0 с 
13 1 1 0 1 D 
14 1 1 1 0 E 
15 1 1 1 1 Е 
—————ы 


In the hexadecimal system, each Position represents a value 16 times 
greater than the position to its immediate right. The place values of 
hexadecimal system are shown below: 

Position ae” a 3 2 1 0 
Place Value —-» 161 16! 16? 16! 16° 

The following example illustrates the decimal value represented by a 

hexadecimal number, 


3 2 1 0 —+— Position 
16° 16* 16! 16° = +— Value 

1 2 А Е 7— Hexadecimal point 
| _ Le х1 = в 
—————— 10х16 = 160 
2x 256 512 
1 х 4096 = 4096 
Sum — 4783 


Thus, 12A, = 4783, 

To convert a binary number to hexadecimal, we need only to group 
the binary digits in sets of four and convert each group to its equivalent 
hexadecimal digit. Thus, the binary number 0111 1010 0001 0010 0001 
becomes 7A121 in hexadecimal. This is illustrated below: 


Binary quadruplets 0111 1010 0001 0010 0001 


+ Е 4 


Hexadecimal point 7 A i 8 d 
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This example clearly illustrates the advantage of hexadecimal system 
over binary system. For all large binary numbers, the hexadecimal rep- 
resentation is much more compact and, therefore, casier to write and 


manipulate than its binary equivalent. 


ERES OCTAL SYSTEM 


The octal number system is a system having base b as 8. The eight octal 
aymbols are 0 through 7. The place values in the octal system are pow- 
ers of 8 as shown below: 

Position sais Д 3 2 1 0 

Place Value ———- 8* g s в! 8° 
‘The position values increase by a factor of B from right to left. The 
example below shows an octal number and its equivalent decimal value: 

3 2 L 0 -— — Position 

ri ^ e ri -——— Value 

——— Octal point 


6х1 = 6 
——= 5x8 40 
0 x 64 0 


——————- 2x512 = 1094 
Sum — 1070 


" 


Thus, 2056, = 1070, 
Since 8 = 2°, each octal digit has a unique 3 bit binary representation. 
This is shown in Table 3.2. 


Table 3.2 Binary representation of octal digits 


Binary Representation 
000 
001 
010 


011 
100 
101 
110 
111 


Just as in the hexadecimal system, to convert a binary number {о octal, 
it is only necessary to group the binary digits in sets of three and con- 
vert each set to its octal equivalent. For example, the binary number 
1011010 can be represented in octal as follows: 
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Binary triplets 001 011 010 
t $ 3 
Octal equivalent 1 3 2 


. ӘХ CONVERSION OF NUMBERS 


We discuss here the following systems of conversion: 
1. non-decimal to decimal system 
2. decimal to non-decimal system 
3. octal to hexadecimal system 
4. hexadecimal to octal system 


Non-decimal System to Decimal System 


We can convert a number in base 2, base 8 or base 16 to a decimal 
number using the expanded notation discussed so far. This conversion 
can also be accomplished using thc following algorithms. 


Integral Part 

1. Multiply the leftmost digit by the base b 

2. Add the next digit to the right to the product 

3. Multiply the sum by the base b and add the next digit 

4. Continue the process until the last (rightmost) digit is added 
The sum is the decimal equivalent of the given integer number 


Fractional Part 

. Multiply the rightmost digit by 1/b 

. Add the next digit to the left to the product 

. Multiply the sum by 1% and add the next digit 

. Continue this process until the last (leftmost) digit in the fractional 
part is added 
5. Multiply the last sum by 1/b 

The product is the decimal equivalent of the given fractional number. 
Note that, in the integral part algorithm, the process ends when the 

rightmost digit is added, but, in the case of fractional part algorithm, the 

process ends when the leftmost digit is added and the final sum is 

multiplied by l/b. 


ж оюн 


the above algorithm as follows: 
Integral Paré Decimal Part 
1 1 9 4 2 ig 
| | | 
| | 


$ed 


| =i 
2 


| 
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Decimal Part 
iL s 1 


BEEN" 
ig 
IL 


— 


Conversion is done as follow. 
Integral Part 
а 2 а FT 
14— 
-x16. | | 
16 


"m — | 


| 
18 | 
x 16 | 

288 
m. A. 
298 | 

х 16 
4768 | 
+15 e 


—1183_ 
Thus, 12АЁ в = 4783; 


Decimal System to Non-decimal System 


It is easy to convert a decimal number to a number of any other system. 
To do this, we must consider the integer and fractional parts separately 
as we did earlier. Algorithms to accomplish this are given below: 
Integral Part 
1. Divide the integer part of the decimal number by the base b of the 
new system. The remainder will constitute the rightmost digit of 
the integer part of the new number. 
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2. Divide the quotient again by the base 6. The remainder is the 
second digit from right. 

3. Continue this process until a zero quotient is obtained. The last 
remainder is the leftmost digit of the new number. 

Fractional Part 

1. Multiply the fractional part of the decimal number by the base b of 
the new system, The integral part of the product constitutes the 
leftmost digit of the fractional part of the new number, 


the rightmost digit of the fractional part of the new number. 
Note that a duplicate fractional part indicates that the sequence will be 
an infinite one. The particular block of digits will be repeated over and 
over again, 


1 0 1 0 | 1 n 
Integral part of binary number 
The digits in the remainder form the binary number when they are 
dropped to the right, 


Fractional Part 


Fractional pari of binary number 
Multiplication Product Integral pat [0 [т 7] 1] 


0.375 x 2 0.75 0 
0.750 x 2 1.50 1 
0.500 x 2 1.00 i= 


The digits in the integral part form the binary number when they are 
read from the top down (or lifted up to the right as shown). 
Thus, 43.375; = 101011.011, 
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Hn 


Convert the decimal 63 to an octal number. 


fora 
EAEE 
Octal number 


Thus, 163,9 = 2435 


` ИШШШ ————— 


Multiplication 
0.65 x 2 

03x2 

06х2 

02x2 

04x2 

0.8 x 2 

0.6 х2 

0.2 x2 

04x2 

0.8 x 2 
Thus, 0.6549 = 1010011001 ... 
Note that a terminating decimal fraction need not have a terminating 
binary equivalent. This happens when a fractional part is repeated and 
therefore the process is terminated. 


Octal and Hexadecimal Conversion 


Using the binary system as an intermediate stage, we can easily convert 
octal numbers to hexadecimal numbers and vice-versa. The steps are as 
follows: 
Octal to Hexadecimal 

1. Write the octal number. 

2. Place the binary equivalent of each digit below the number. 

3. Regroup them as binary quadruplets from the binary point, with 

zeros added, if necessary. 

4. Convert each group into its equivalent hexadecimal digit. 
Hexadecimal to Octal 

1. Write the hexadecimal number. 

2. Place the binary equivalent of each digit below the number. 
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3. Regroup them as binary triplets from the binary point, with zeros 
added if necessary. 
4. Convert each group into its equivalent octal digit. 


Binary equivalent 010 100 011 -—-— —- Binary point 
Regrouped as 

binary quadruplets а 1010 0011 

Y 

Hexadecimal 0 А 3 
Thus, 243, = A316 


Hexadecimal 3 9 . B 8 
Binary equivalent 0011 1001  . 1011 1000 
Regrouped as 

binary triplets 000 


Hexadecimal 0 
Thus, 39.88, = 71.56, 


The grouping of binary digits into triplets or quadruplets plays an im- 
portant role in the internal organisation of information in computers. 
‘They are often used to represent long binary strings with lesser number 
of symbols. 


7 8793 REPRESENTATION OF NUMBERS 


As mentioned earlier, all modern computers are designed to use binary 
digits to represent numbers and other information. The memory is usu- 
ally organised into strings of bits called words. Each such string has the 
same length in a particular computer, although different computers may 
use different word lengths. For example, IBM PC and AT systems use а 
word length of 16 bits, while VAX 11 systems use a word length of 32 bits. 

The largest number a computer can store depends on its word length. 
For example, the largest binary number a 16 bit word can hold is 16 bits 
of 1, This binary number is equivalent to a decimal value of 65535. The 
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largest decimal number that can be stored in a computer is given by the 
following relation: 

Largest number = 2^ — 1 
where n is the word length in bits. Thus, we see that the greater the 
number of bits, the larger the number that may be stored. 

Although the computer works well with the binary numbers, humans 
do not. Firstly, it takes too many bits to represent a number. Secondly, 
writing such long series of bits can be exhausting and may cause errors. 
This is why we have other systems such as octal, hexadecimal and deci- 
mal systems, Computers read decimal numbers supplied by humans but 
convert them automatically into binary numbers for internal use, These 
binary numbers may also be expressed in the octal or hexadecimal form 
for print-out or display. For output, the numbers are reconverted to 
decimal form for human use, 


Integer Representation 


Decimal numbers are first converted into the binary equivalent and 
then represented in either integer or floating point form. Let us first 
consider the integer representation. 

For integers, the decimal or binary point is always fixed to the right 
of the least significant digit and therefore, fractions are not included. As 
mentioned earlier, the magnitude of the number is restricted to 2" — 1, 
where п is the word length in bits. 

How do we represent negative numbers? Negative numbers are stored 
by using the 2’s complement. This is achieved by taking the 1% comple- 
ment of the binary representation of the positive number and then add- 
ing 1 to it. 


10011 


Note that we have used an extra 0 to the left of the binary number 
representing 13. This is to indicate that the number is positive. 

Then, if the leftmost bit is 1, the number js negative. The leftmost (or 
the most significant) bit of a binary n'imber which is used to indicate the 
sign is called the sign bit. 

Now we see that if we reserve one bit to represent the sign of the 
number, we have only z — 1 bits to represent the number. Thus, a 16 bit 
word can contain numbers – 215 to 2!5 _ 1 (i.e, —32168 to 32767). 
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Show that the number -32768 is represented in a 16 bit word ав еа: 
1[0][0[0[0[0[0]0]0 


(-32767) + (- 
32167 0111 1113 1111 1111 
1's complement 1000 0000 0000 0000 


= +0000 0000 0000 0001 


-32767 _1000 0000 00000001 ^ ~—— (а) 
1 0000 0000 0001 0001 


1111 1111 1111 1110 


l'a complement 
" = + 0000 0000 0000 0001 


-1.2 _ 19111111 11111111  ~—— Ф 
-32768 1000 0000 0000 0000 -<——— (а) + (b) 
Floating Point Representation 


We have just seen how integer numbers are represented. We have also 
seen that a 16 bit computer cannot store a positive number larger than 
32767. What if we want to handle a fractional number like 35.7812 or a 
large number like 987654321? Such numbers are stored and processed. 
in what is known as exponential form. These numbers have an embed- 
ded decimal point and are called floating point numbers or real numbers. 
For example, 35.7812 can be expressed 0.357812 х 10°, Similarly, the 
number 987654321 can be expressed as 0.987654 x 10°, By writing a 
large number in exponential form, we lose some digits. If x is a real 
number, its floating point form representation is 
x=fx 10% 

The number f is called mantissa and E is the exponent. 

Floating: point numbers are stored differently. The entire memory 
location is divided into three fields or parts as shown in Fig. 3.1. The 
first part (1 bit) is reserved for the sign, the second part (7 bits) for the 
exponent of the number, and the third (24 bits) for the mantisaa of the 
number. Typically, floating numbers use a field width of 32 bits where 
24 bits are used for the mantissa and 7 bits for the exponent. 


31 30 24 23 0 


EXPONENT MANTISSA 


zo-o 


1 bit He— 7 bits —epe— 24 bhe —>{ 
Fig. 3.1 Floating point representation 


2-204696 
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Thus, we can represent very small fractions or very large numbers with- 
in the computer using the floating point representation. 


Convert the following numbers to floating point notation. 
0.00596, 65.7452, 486. 


0.00596 is expressed as 
65.7452 is expressed as 
-486.8 is expressed as - 0.4868 x 10* 


The shifting of the decimal point to the left of the most significant digit 
is called normalisation and the numbers represented in normalised form 
are known as normalised floating point numbers. You may note that the 
mantissa should satisfy the following conditions. 

For positive numbers: less than 1.0 but greater than or equal to 0.1 

For negative numbers: greater than —1.0 but less than or equal to —0.1 
That is, 0.1 < |/| <1 
The normalised floating point numbers are written using the following 
notation: 

0.596 x 10? written as 596 E -2 
-0.4868 x10? writtenas -– 4868 ЕЗ 


EA COMPUTER ARITHMETIC 


Different systems of computer arithmetic are currently available. They 
include integer arithmetic, fixed point arithmetic, floating point arith- 
metic, interval arithmetic and karlsruhe accurate arithmetic. 

They are either supported by hardware or software or some software/ 
hardware combinations. Each system uses its own scheme for represent- 
ing numbers in binary form within the machine. The most common and 
popular arithmetic systems are integer arithmetic and floating point 
arithmetic, Thesé systems are discussed briefly in this section. 


Integer Arithmetic 


Virtually all computers offer integer arithmetic. The main property of 
integer arithmetic is that the result of any arithmetic operation with 
integers is an integer. The other property is that the result is always 
exact with the following two exceptions; 
1. The range of integers that can be represented is not infinite but is 
bounded above and below. 
2. The result of an integer division is usually given as a quotient and 
a remainder (since fractions cannot be represented in the integer 
scheme) which is truncated. 


Computer Codes and Arithmetic S3 


mmu 0 ] 


Tllustrate integer arithmetic 
Addition: 2541 
Subtraction: 25-1 

12-25- 
Multiplication: 25 х 1 


Division: 25 +1. 
12 +25= 0 
Note that as only a finite range of integers can be represented, the 
product of two numbers may exceed the range. 


Show that the following rule does not generally hold good in integer 
arithmetic. 


Leta =5,6=7ande=3 


b 5+1 
Then, E шн 


The results are not identical, This is because the remainder of an inte- 
ger division is always truncated. 


Floating Point Arithmetic 


Although integer arithmetic is adequate in many computing applica- 
tions, it does not meet the requirements of many numerical computing 
methods as they often involve manipulation of fractional numbers. Hence, 
floating point arithmetic is the preferred choice for и majority of numer- 
ical computing applications. 

In the floating point system, all the numbers are stored and processed 
in normalised exponential form. The most difficult operations in the 
floating point arithmetic are addition and subtraction. 


Addition Let the two numbers to be added be x and y, and let z be the 
result. Let the fractional parts and exponents be f,, f, and f., and E,, Ey 
and E,, respectively. Then, the addition algorithm is as follows: 
1. Set E, = the larger of E, and E,. (Assume here E, >= E,. Then 
E, = E,). 
2. Shift Right. Shift f, to the right by E, — E, places. (This makes the 
exponent of f, and f, the same). 
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3. Add. Set f, - f, +f, 
4. Normalise, If the absolute value of f, is greater than one, shift the 
decimal point of f. to the left of the most significant digit and then 
increase E, by one. 
Then z= f, x 10% 


In all the manipulations, the result of any operation is normalised 
and the mantissa is rounded or truncated to p digits, where p is the 
precision of the computer used. In the examples discussed here, we 
assume a precision of 6 and the mantissa is truncated to 6 digits. 


NH. ____ 


Add the numbers 0.964572 E2 and 0.586351 E5. 


Let x = 0.586351 and у = 0.964572 E2 

E,-5 

Г, = 0.000964 

Ё = 0.000964 + 0.586351 = 0.587315 
Then, г = 0.587315 Е5 
Note that both the mantissa and exponent of the number with the small- 
er exponent are modified and the modified mantissa is truncated to six 
digits, 


f= 0.735816 + 0.635742 = 1.371558 
z = 1,371558 E4 = 0.137155 E5 
Note that the mantissa of the result is truncated, 


Subtraction Subtraction is nothing but addition of numbers with dif- 
ferent signs. However, the subtraction of mantissas may result in a 
number less than 0,1. In such cases, the decimal point should be shifted 
to the left of the most significant digit and the exponent of the result 
should then be decreased accordingly. 


999658 E-3 and y y= T09 994576 E. E-3 
E,=-3 
f, = 0.999658 — 0.994576 

= 0.005082 

x-y 

= 0.005082 E - 3 

= 0.508200 E - 5 (normalised) 


2 
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Multiplication Multiplication of two floating point numbers is relative- 
ly simple. 

1. Multiply the fractional parts: f, = f, f, 

2. Add the exponents: E, = E, + E, 

3. Then, z = f, x 10 

4. Normalise, if necessary. 


Multiply the numbers 0. 200000 EA: and 0.400000 E ~ 2 


.080000 E2 = 0.800000 E1 (normalised) = 0.800000 
Division Division is done as follows: 
1, Divide the fractional parts: f. = /./f, 
2. Subtract the exponente: E, - E, Ё, 
3. z = f, x 10% 
4. Normalise, if necessary. 
Divide the number 0.876543 Е — 5 by 0.200000 Е - 3 


.876543 + 0.200000 
.382715 
5 — (-3) = -2 
z = 4.382715 E- 2 
= 0.438271 E - 1 (normalised) 
Note that the mantissa of the result is truncated, 


EA ERRORS IN ARITHMETIC 


In integer arithmetic, while all arithmetic operations are exact, we might 
come across the following two situations: 
1. An operation may result in a large number that is beyond the 
range of the numbers that the computer can handle. 
2, An integer division may result in truncation of the remainder. 
When the result is larger than the maximum limit, it is referred to as 
an overflow and when it is less than the lower limit, it is referred to ав 
underflow. Unfortunately, most computers do not issue any warnings or 
messages on integer overflow or underflow. Therefore, we should use 
integer arithmetic with utmost care. 
The floating point arithmetic system is prone to the following errors: 
1. Error due to inexact representation of a decimal number in binary 
form. For example, consider the decimal number 0.1. The binary 
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equivalent of this number is 0.0001100110011.... The binary equiv- 
alent has a repeating fraction and therefore must be terminated at 
some point. 

2. Error due to rounding method used by the computer, in order to 
limit the number of significant digits. This was illustrated in the 
examples discussed in Section 3.8. In fact, if the numbers added 
are too different in magnitude, the smaller may be treated as if it 
were zero (see Example 3.18). 

3. Floating point subtraction may induce a special phenomenon, It is 
possible that some mantissa positions in the result are unspecified. 
This happens when two nearly equal numbers are subtracted. This 
is known as subtractive cancellation. If the operands themselves 
represent approximate values, the loss of significance is serious 
since it greatly reduces the number of significant digits. The error 
can be arbitrarily large (see Example 3.21). 

4. Overflow or underflow can occur in floating point operations when 
the result is outside the limits of floating point number system of 
the computer. 

The following examples illustrate these errors 


Add the numbers 0.500000 E1 and 0.100000 E - 7. 


Let 
x = 0.500000 E1 and y = 0.100000 E ~ 7 
Е,=1 
fy = 0.000000001 
J, = 0.500000001 = 0.500000 


2 = 0.500000 E1 
Note that the value of їз the same аз that of x. 


f. = 0.175000 
2 = 0.175000 E110 


If we assume that the exponent can have a maximum value of 99, then 
the result overflows. 


Divide the number 0.875000 E — 18 by 0.200000 E95. 


E, = -18 95 - -113 
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fa = 0.875000 + 0.200000 = 4.375000 
2 = 4.375000 E — 113 
= 0.437500 E - 114 


If we assume that the exponent can have a minimum value of - 99, then 
the result underflows. 


Thus, z = 0.000002 x MI = 0.200000 х 10° 


The result contains only one significant digit. If the values of x and y are 
not exact, then the result may not reflect the true difference between 
them. In many systems, these unspecified digits are filled by arbitrary 
digits thus causing a further increase in the error. 


B LAWS OF ARITHMETIC 


Due to errors introduced in floating point arithmetic, the associative and 
distributive laws of arithmetic are not always satisfied. That is, 

х + (у +2) а(х жу) +2 

хх (ух) а(х ху) х2 

хх (y + 2) = (£ x y) +(x x2) 
Although failure of these laws to be satisfied affects relatively few com- 
putations, it can be very critical on some occasions. The examples that 
follow illustrate the discrepancies. 


cel . 


Associative law for addition 


0.004567 + 0.243451 = 0.248018 
0.248018 — 0.248000 = 0.000018 
0.180000 х 107+ 
0.243451 — 0.248000 = — 0.004549 = -0.4549 x 10? 
(0.456732 - 0.454900) 107° 
= 0.183200 х 107° 
Thu, (x+y) +2#х+(у+2) 
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e xyxz- (0.200000 x 10*110) (0.300000 х 10:9?) 
Note that (x x y) causes overflow and so the result will be erroneous. 
x x (y x 2) = (0.400000 х 10*5) x (0.1500000 x 1019) 
= 0.060000 х 10% = 0.600000 х 1079 


This gives the correct result assuming that the exponent can take a 
value up to +99. 


Distributive 1 law 


Lei x = 0400000 x 10!, у = 0.200001 х 10°, г = 0.200000 х 10° 
x x (y – 2) = (0.400000 х 10") х (0,100000 x 10-5) 
= 0.400000 x 107° 


(x x y) - (x x z) = 0.800000 x 10° - 0.800000 х 10^ = 0 


mes SUMMARY 


In this chapter, we have discussed a very important aspect of numerical 
computing, namely, the internal representation of numbers in a comput- 
er. We considered the following in detail: 

» number systems that are popularly used in computing 

e conversion of numbers from one system to another 

e storage of numbers in the memory of a computer 

different systems of arithmetic operations that are commonly used 
in numerical computing 
errors introduced by arithmetic operations 
e associative and distributive laws of arithmetic 


Key Terms 


Associativa law Interval arithmetic 
Base Mantissa 

Binary digits Normalisation 
Binary numbers Nommalised numbers 
Bits Octal numbers 
Computer memory Overflow. 

Data Processing 

Decimal numbers Quadruplets 
Distributive law Radix point 


(Сота) 
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Real numbers 
Fixed point arithmetic _ Sign bit 
Floating point form Storage 
Floating point arithmetic Subtractive cancellation 
Hexadecimal numbers Transistor 
Integer arithmetic Triplets 


Integer form Undertiow 


1. How many binary digits are there? Which symbols are used for 
them? What are they usually called? 

2. Binary digits are used to store and manipulate data in computers. 
Why? Why do then we use other number systems? 

3. What is the complement of a number? Obtain the complement of 
the decimal number 5749? 

4. How do we obtain one's complement and two's complement of a 
binary number? = 

5. What are the uses of complements of binary numbers? 

6. What is sign bit? How does the computer store a negative number? 

7. An 8-bit register stores numbers in two's complement form. 

(a) What is the largest positive decimal number that can be 
Stored? 

(b) What is the smallest negative decimal number that can be 
stored? 

8. Why do we need to represent numbers in exponential form? Ex- 
plain how a decimal number is represented inside the computer 
using the exponential form? 

9. Explain the following: 

(a) Overflow 
(b) Underflow 
10. Discuss the errors that may occur during the floating point arith- 
metic operations. 
11. The hexadecimal equivalent of the binary number 10011101 is 


(a) 5A (b) FF (c) 9D (d) 9E 
12. The decimal equivalent of the binary number 10011101 is 
(a) 27 (b) 157 (с) 13 (d) 144 


13. The binary equivalent of the octal number 42 is 

(а) 101110 (b) 111010 (с) 100010 (а) 101011 
14. The octal equivalent of the hexadecimal number CD5 is 

(a) 3625 (b) 6325 (с) 3652 (d) 6352 
15. The hexadecimal equivalent of the decimal number 163 is 

(a) B3 (b) A2 (c) A3 (d) 93 
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. Convert the decimal numbers (i) 29, (ii) 123, and (iii) 432 to 


(a) binary system 
(b) octal system. 
(c) hexadecimal system 


. Convert the octal numbers (i) 25, (ii) 52, and (iii) 563 to 


(а) decimal system 
(b) binary system 
(c) hexadecimal system 


. Convert the hexadecimal numbers (i) ВЕ, (ii) BC4, and (iii) AF3D to 


(a) decimal system 
(b) octal system 
(c) binary system 


. Convert the binary numbers (i) 0101, (ii) 0111.0111, and (iii) 1011.11 


to 
(a) decimnl system 
(b) octal system 
(c) hexadecimal system 


. Assuming that the computer stores each number in a 16-bit memo- 


ry location, find the internal. representations of the following num- 
bers: 


(a) 498 
(b) -498 
. Write the following numbers in normalised exponential form and 
E-form. 
(a) 12.34 (d) -0.009876 
(b) -654.321 (e) 0.0 
(c) 0.001234 (f) 12345 


. Assuming that the mantissas are truncated to 4 decimal digits, 


show how the computer performs the following floating point oper- 
ations: 

(a) 0.5678 x 10* + 0.6666 » 10* 

(b) 0.1234 x 10* + 0.4455 x 1072 

(c) 0.3366 х 1072 —0.2244 х 107! 

(d) 0.6789 х 10 x 0.2233 x 101 

(e) 0.6789 x 10° + 0.2233 x 10* 


. Assuming that the mantissas are truncated to 4 decimal digits, 


compute the error in the following computations: 
(a) 5.6789 — 1.2345 
(b) 5.6789 4 9.2345 


. Tilustrate with examples the concept of overflow and underflow. 
. Discuss an example to show that the distributive law of arithmetic 


is not always satisfied in numerical computing 


Approximations апа 
Errors in Computing 


[И INTRODUCTION 


Approximations and errors are an integral part of human life. They are 
everywhero-and unavoidable. This is more son the life of a. computational 
scientist. 

We cannot use numerical methods and ignore the existence of errors. 
Errors come in a variety of forms and sizes; some are avoidable, some 
are not. For example, data conversion and roundoff errors cannot be 
avoided, but a human error can be eliminated. Although certain errors 
cannot be eliminated completely, we must at least know the bounds of 
these errors to make use of our final solution. It is therefore essential to 
know how errors arise, how they grow during the numerical process, 
and how they affect the accuracy of a solution. 

By careful analysis and proper design and implementation of 
algorithms, we can restrict their effect quite significantly. 

As mentioned earlier, a number of different types of errors arise during 
the process of numerical computing. All these errors contribute to the 
total error in the final result. A taxonomy of errors encountered in a 
numerical process is given in Fig. 4.1 which shows that every stage of 
the numerical computing cycle contributes to the total error. 

Although perfection is what we strive for, it is rarely achieved in 
practice due to a variety of factors. But that must not deter our attempts 
to achieve near perfection. Again the question is: How much near? 

In this chapter we discuss the various forms of approximations and 
errors, their sources, how they propagate during the numerical process, 
and how they affect the result as well as the solution process. 
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machine 


Fig. 4.1 Taxonomy of errors 


E SIGNIFICANT DIGITS 


know that all computers operate with a fixed length of numbers. In 
particular, we have seen that the floating point representation requires 
the mantissa to be of a specified number of digits. Some numbers cannot. 
be represented exactly in a given number of decimal digits. For example, 
the quantity лів equal to 
r 3.1416526535897932384626.. 


Such numbers can never be represented accurately. We may write it 
as 3.14, 3.14159, or 3.141592653. In all cases we have omitted some 
digits. 

Note that transcendental and irrational numbers do not have а 
terminating representation. Some ralional numbers also have а 
repeating decimal pattern. For instance, the rational number 2/7 = 
0.285714285714.... Suppose we write 2/7 as 0.285714 and ~ as 3.14159. 
Then we say the numbers contain six significant digits. М 

The concept of significant digits has been introduced primarily to 
indicate the accuracy of a numerical value. For example, if, in the number 
у = 23.40657, only the digits 23406 are correct, then we may say that у 
has five significant digits and is correct to only three decimal places. 

Tn general, when a number is said to be "good to four digits", it means 
that the number has four significant digits. The omission of certain 
digits from a number results in what is called roundoff error. ‘The following 
statements describe the notion of significant digits. aa ИШЕ, 
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~ AC All non-zero digits are significant. 
(AN zeros occurring between non-zero digits are significant digits. 
У Trailing zeros following a decimal point are significant. For example, 
50, 65.0 and 0.230 have three significant digits each. 
re between the decimal point and preceding a non-zero digit are 
not significant. For example, the following numbers have four 
significant digits. 07 
0.0001234 (1234 х 1077) 
0.001934 (1234 х 10%) 
0.01934 (1234 х 1055) 
„ When the decimal point is not written, trailing zeros аге not 
considered to be significant. For example, 4500 may be written as 
45 « 10? and contains only two significant digits. However, 4500.0 
contains four significant digits. Further examples are: 
7.56 х 10* has three significant digits. 
7.560 x 10* has four significant digits. 
7.5600 « 10* has five significant digits. 
Integer numbers with trailing zeros may be written in scientific notation 
to specify the significant digits. 
The concept of accuracy and precision are closely related to significant. 


digits. They are related as follows: М 
A. Accuracy refers to the number of significant digits in a value. For 


example, the-number 57.396 is accurate to five significant digits 


7-9. Precision refers to the number of decimal positions, i.c. the order of 
last digit in a value. ‘The number 57.396 has a 


азар 
has a precision of 107 
has a precision of 1076 


át is the accuracy of the following numbers? 
(a) 95.763 (b) 0.008472 (с) 0.0456000 (9) 36 fe) 3600 (f) 3600.00 


(a) This has five sd. 

(b) This has four sd. The leading or higher order zeros are only place 
holders. 

(с) This has six sd. 
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) This-has two sd. 
a Accuracy is not specified. 
(D This has =e Note that the zeros were made significant by writing 
.00 after . 


INHERENT ERRORS 


rent errors are those that are present in the data supplied to the 
model. Inherent errors (also known as input errors) contain two 
components, namely, data errors and conversion errors. 


bade 

(а error (also known as empirical error) arises when data for a problem 
are obtained by some experimental means and are, therefore, of limited 
accuracy and precision. This may be due to some limitations in 
instrumentation and reading, and therefore may be unavoidablg A 
physical measurement, such as a distance, a voltage, or a time period, 
cannot be exact. It is, therefore, important to remember that there is no 
use in performing arithmetic operations to, say, four decimal places 
when the original data themselves aro only correct to two decimal places. 
Fof instance, the scale reading in a weighing machine may be accurate 

unb ane decimal place. 


- Conversion Errors 


“Conversion errors (also known as representation errors) arise due to the 
Timitations of the computer to store the data exactly, We know that the 
floating point representation retains only a specified number of digits. 
The digits that are not retained constitute the roundoff errog). 

As we have already seen, many numbers cannot be represented exactly 
in a given number of decimal digits. In some cases a decimal number 
cannot be represented exactly in binary form. For example, the decimal 
number 0,1 has a non-terminating binary form like 
0.00011001100110011.... but the computer retains only a specified number 
of bits. Thus, if we add 10 such numbers in a computer, the result will 

not be exactly 1.0 because of roundoff error during the conversion of 0.1 

to binary firm, 


epresent the decimal numbers 0.1 and 0.4 in binary form with an 
accuracy of 8 binary digits. Add them and then convert the result back 
to the decimal form 


0.0001 1001 
0.0110 0110 
Sam = 0.0111 1111 
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= 0.25 + 0.125 + 0.0625 + 0.03125 + 0.015625 
+ 0.0078125 + 0.00390625 
= 0.49609375 
Note that the answer should be 0.5, but it is not. This is due to the error 
in conversion from decimal to binary form: Remember, both the numbers 
have non-terminating binary representation. 


Error is equal to 2* = 0.00390625. It is clear that the error can be 
reduced by increasing inary digits that represent the number, For 
example, if we use its, then the error will be equal to 2-15 = 0.15258789 
x 107* A 


NUMERICAL ERRORS 


‘merical errors (also known as procedural errors) are introduced during. 
the process of implementation of a numerical method. They corae in-two 
forms, roundoff errors and fruncation errors) The total-numerical error 
is the summation of these two errors. "The total error can be reduced by 
devising suitable techniques for implementing the solution. We shall see 
“in this section the magnitude of these errors. 


Я doff Errors - - 


'undoff errors occur when a fixed number of digits are used to represent 
exact numbers) Since the numbers. are stored at every stage of 
computation, roûndof error is introduced at the end of every arithmetic 
operation \Consequently, eventhough an individual roundoff error could 
be very sthall, the cumulative effect of a series of computations can be 
very significant. 

Rounding а number can be done in two ways. One is known as chopping 
and raga known as symmetric rounding. Some systems use the 


chopping «tiethod while others use symmetric rounding. 


pping 
n chopping, the extra digits are dropped. This is called truncating the 
number. Suppose. we are using a computer with a fixed word length of 
four ne a Aap 894 si be Bi Mev ree ona а 
digits/93 wil ro . We ean ess the num А in floatinj 
palit faris as ail Ex =" 
x = 0.427893 » 10* 

= (0.4278 + 0.000093) х 10* 

= 10.4278 + (0.93 x 10)) х 10? 
This can be expressed in general form as 

True x = (f, + g, x 10 4)108 
7f x 108 +g, х 1908-8 
= approximate x + error. 
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where f, is the mantissa, d is the length of the mantissa permitted and E 
is the exponent. In chopping, g, is ignored entirely and therefore, 
Error =g, x 10 -4,0sg,<1 

The absolute error introduced depends on the following: 

1. the size of the digits dropped 

2. number of digits in mantissa 

3, the size of the number 
Since the maximum value of g, is less than 1.0, 


Symmetric Roundoff 
Tn the symmetric roundoff method, the last retained significant digit is 
"rounded up" by 1 if the first discarded digit is larger or equal to 5; 
otherwise, the last retained digit is unchanged. For example, the number 
42.1893 would become 42.79 and the number 76.5432 would become 
76.54, 
As before, the value of unrounded number can be expressed as 
True x = f х 108 + g, x 10577 
When д, < 0.5, entire д, is truncated and therefore, 


Approximate x f, x 10* 


and 
Emor-g,x10577, g, < 0.5 
When g, 2 0.5, the last digit in the mantissa is increased by 1 and 
therefore 
Approximate x = (f, + 10%) x 10 = f, х 105 + 105^ 4 
Error = [f, x 107 + g, х 10577] - (f, x 10 + 105-4] 


(g,-0x10-4) 6,205 
In either case, 105 - ? is multiplied by factor whose absolute value is no 


greater than 0.5. Therefore, the value of the absolute error is 
Absolute error < 0.5 х 105-4 
Note that the symmetric rounding error is, at worst, one-half the chopping 
error. 
Sometimes a slightly more refined rule is used when the g, is exactly 
equal to 0.5. Here f, is unchanged if its last digit is even and is increased 
by 1 if its last digit is odd. 


d-the roundoff error in storing the number 752.6835 using a four 
igit mantissa. 


== “True x = 0.1526 x 10° + 0.835 x 107 
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Chopping method 
Approximate x = 0.7526 x 10° 
Error = 0.0835 
Symmetric rounding "3 
Errer=tg,— D х 107 
~ = -0.165 x 10"! = — 0.0165 
imate х = 0.7527 x 10° 


‘runcation errors arise from using an approximation in place of an exact 
mathematical procedure) Typically, it is the error resulting from the 
truncation of the numerfcal process. We often use some finite number of 
terms to estimate the sum of an infinite series. For example, 


S «Sa; xí. is replaced by the finite sum Ya, x! 
ia ind 


The series has been truncated. 

Another example is the use of a number of discrete steps in the solution 
of a differential equation. The error introduced by such discrete 
approximations is also called discretisation error. Consider the following 
infinite series: 


х5 
1 Б! 

When we calculate the sine of an angle using this series, we cannot 
use all the terms in the serics for computation. We usually terminate 
the process after a certain term is calculated. The terms “truncated” 
introduce an error which is called ¢runcation error. 

Many of the iterative procedures used in numerical computing are 
infinite and, therefore, a knowledge of this error is important. Trunca- 
tion error can be reduced by using a better numerical model which 
usually increases the number of arithmetic operations. For example, in 
numerical integration, the truncation error can be reduced by increasing 
the number of points at which the function is integrated. But care should 
be exercised to see that the roundoff error which is bound to increase 
due to increase in arithmetic operations does not off-set the reduction in 
truncation error. 

We often use library functions to compute logarithms, exponentials, 
trigonometric functions, hyperbolic functions, and so on. In all these 
cases, a series is used to evaluate these functions. It is important to 
know the truncation errors introduced by these library functions. 
‘Truncation errors are discussed in detail in many places in this book. 
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" Ru 
í 
Pira truncation error in the result of the following function for 


x = 1/5 when we use (a) gs ihe terms, (b) first four terms, and 
(c) first five terms. 


х oF ot x4 хр gt 
le A(R he 


(a) Truncation error when first three terms are added 


Truncation error = + E = me 2 * a 
0.23 0.24 (0.25 9.26 
ТЗ 17947 7120 * 720 
= 0.1402755 10? 7 
(b) Truncation error when first four terms are added 
‘Truncation error.= 0.694222 х 10 * 
(c) Truncation errér when first five terms are added 
‘Truncation error = 0.275555 х 105 


газ 19,94 0:22 02* 0.24 0,25 0.28 
id 023573 8 " 24 120 5 720 


(a) Truncation error (three terms) = — 0.1279255 x 10? 
(b) Truncation error (four terms) = + 0.6665556 х 10+ 


(c) Truncation error (five terms) = – 0.257777 x 1055 
Note that 


| E41 e» 
ITE] um 


5 
ITE.5| Ser 
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formulated to represent physical processes using certain parameters 
involved in the situations. In many situations, it is impractical or 
impossible to include all of the real problem and, therefore, certain 
simplifying assumptions are made. For example, while developing a model 
for calculating the force acting on a falling body, we may not be able to 
estimate the air resistance coefficient (drag coefficient) properly or 
determine the direction and magnitude of wind force acting on the body, 
and so on. To simplify the model, we may assume that the force due to 
air resistance is linearly proportional to the velocity of the falling body 
or we may assume that there is no wind force acting on the body. All 
such simplifications certainly result in errors in the output from such 
models. 

Since a model is a basic input to the numerical process, no numerical 
method will provide adequate results if the model is erroneously conceived 
and formulated. It is obvious that we can reduce these type of errors by 
refining or enlarging the models by incorporating more features. But the 
enhancement may make the model more difficult to solve or may take 
more time to implement the solution process. It is also not always true 
that an enhanced model will provide better results, We must note that 
modelling, data quality and computation go hand in hand. An overly 
refined model with inaccurate data ог an inadequate computer may not 
be meaningful. On the other hand, an oversimplified model may produce 
a result that is unacceptable: It is, therefore, necessary to strike’a balance 
between the level of accuracy and the complexity of the model. A model 
must incorporate ойу those features that are essential to reduce the 
error to an acceptable level 


BLUNDERS _ 

funders are errors that are caused due to human imperfection. As the 
name indicates, such errors may cause a very serious disaster in the 
result. Since these errors are due to human mistakes, it should be possible 
to avoid them to a large extent by acquiring a sound knowledge of all 
aspects of the problem as well as the numerical process. 

Human errors can occur at any stage of the numerical processing 
cycle, Some common types of errors are: 

1. lack of understanding of the problem 

2. wrong assumptions 

3. overlooking of some basic assumptions required for formulating the 

~ model 
4, errors in deriving the mathematical equation or using a model that 

does not describe adequately the physical system under study 
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5. selecting a wrong numerical method for solving the mathematical 
model 
6. selecting a wrong algorithm for implementing the numerical method 
7. making mistakes in the computer program, such as testing a real 
number for zero and using < symbol in Place of > symbol 
8. mistakes in data input, such as misprints, giving values column- 
wise instead of row-wise to a matrix, forgetting a negative sign, ete. 
9. wrong guessing of initial values 
As mentioned earlier, all these mistakes can be avoided through a 
reasonable understanding of the problem and the numerical solution 
methods, and use of gail programming techniques and tools. 


[ЖЛ] ABSOLUTE AND RELATIVE ERRORS A“ 


\ Let us-üow consider some fundamental definitions of error analysis, 
1 rdless of its source, an error is usually quantified in two different 
but related ways. One is Known as absolute error and the other Ts called 
реа с=т= отыш ыны 

pose that the true value of a data item is denoted by x, and 
its approximate value is denoted by x,- Then, they are related as follows: 
True value x, = Approximate value Xa + Error. 
The error is then given by 
Error = x, - x, 
The error may be negative or positive depending on the values of x, and 
Xa In error analysis, what is important is the magnitude of the error 


and not the sign and, therefore, we normally consider what is known as 
absolute error which is denoted by 


—— A т, 
In many cases, phos — its influence correctly as it 


does not take into account the order of magnitude of the value under 
study. For example, an error of 1 gram is much more significant in the 
weight of a 10 gram gold chain than in the weight of a bag of rice. In 
view of this, we introduce the concept of relative error which is nothing 
but the “normalised” absolute error. The relative error is defined as 
follows: 


_ absolute error, 


More often, the quantity own to us is x, and, therefore, we can 
modify the above relation as follows: 
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The fractional form of e, can also be expressed as the per cent relative 
error as 


Pef cent e, = e, x 100 
P as ina 


A civil engineer has measured the height of a 10 floor building as 2950 
cm and the working height of each beam as 35 ет while the true values 
are 2945 cm and 30 cm, respectively. Compare their absolute and rela- 
tive errors, 
Absolute error in measuring the height of the building is 
e, = 2950 – 2945 = 5 em 
‘The relative error іѕ ~ 9 
“5° ene 5/2945 = 0.0017 = 0.17% 
Absolute error d measuring the height of the beam is 
ез = 35 – 30 = 5 cm 


The relative error is m 
е, a£ /30 = 0.17 = 17% 


Although the absolute prot ат are the same, the relative errors differ by 
100 times. It shows that there is something wrong in the measurement 
of the height ofthe beam. It should be done more accurately. 


MACHINE EPSILON 


Recall that the round off error introduced in a number when it is repre- 
sented in floating point form is given by 


Chopping error = g x 10*- 4, Osg<l 


where g represents the truncated part of the number in normalised 
form, d is the number of digits permitted in the mantissa, and F is the 
exponent. The absolute relative error due to chopping is then given by 


&x10£-4 


The relative error is maximum when g is maximum and f із minimum. 
We know that the maximum possible valuc of g is less than 1.0 and 
minimum possible value of f is 0.1. The absolute value of the relative 
error therefore satisfies. 
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210411 


Р 10х10Ё-4 
” | 0.1% 108 


The maximum relative error given above is known as machine epsilon. 
The name “machine” indicates that this value is machine dependent. 
This is true because the length of mantissa d is machine dependent. For 
a decimal machine that uses chopping, 


Machine epsilon s= 107 * ! 
Similarly, for a machine which uses symmetric roundoff, 


0.5% 108-2] | 
i 107471 
6101510 | 2" 


Machine epsilon £ de 10-4*! | 


and therefore 


It is important to note that the machine epsilon represents upper bound 
for the roundoff error due to floating point representation. It also sug- 
gests that data can bo represented in the machine with d significant 
decimal digits and the relative error does not depend in any way on the 
size of the number. 

More generally, for a number x represented in a computer, 


[_ Absolute error bound = |x| хк 


For a computer system with binary representation, the machine epsi- 
lon is given by 


Chopping 
Machine epsilon = = 

Symmetric rounding 
Machine epsilon г = 274 

Note that we have simply replaced the base 10 by base 2. Here d 
indicates the length of binary mantissa in bits. 

We may generalise the expression for machine epsilon for a machine 
which uses base b with d-digit mantissa as follows: 
` а= х 79 for chopping 
£= b/2 x b for symrhetric rounding 


9-4+1 


а computer uses a number base 2, how many significant decimal 
figits are contained in the mantissa of flaating numbers? 


Assume that the binary computer has it mantissa. Then the error 
bound is 2”. This computer will have q mni icant digits with symmetric 
rounding, if, 
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2? = 1/2 х 107"! 
Taking logarithms to the base 10, we get 
g=14(p—1) logy 2 
If we assume p = 24, then 
q= 1+ 23 logy, 2 = 7.9 
We may say that the computer can store numbers with seven significant 
decimal digits. 


ERROR PROPAGATION 


Numerical computing involves a series of computations consisting of 
basic arithmetic operations. Therefore, it is not the individual roundoff 
errors that are important but the final error on the result. Our major 
concern is how an error at one point in the process propagates and how 
it effects the final total error. In this section, we will discuss the arith- 
metic of error propagation and its effects. 


Addition and Subtraction 


Consider addition of two numbers, say, x and y. 
Xy + Yi = Xa + er + Ya tey 
= (xa + Yo) + (е, e) 
Therefore, 
Total error = e, 


ry =O, + ey 
Similarly, for subtraction 


Total error = е, 


=e 


Note that the addition е, + с, does not mean that error will increase in 
all cases. It depends on the sign of individual errors. Similar is the case 
with subtractions. 

Since we do not normally know the sign of errors, we can only esti- 
mate error bounds. That is, we can say that 

lessl < lel + 16,1 

Therefore, the rule for addition and subtraction is: the magnitude of 
the absolute error of a sum (or difference) is equal to or less than the sum 
of the magnitudes of the absolute errors of the operands. 

This inequality is called the triangle inequality. The equality applies 
when the operands have the same signs, and the inequality applies if 
the signs are different. 


Multiplication 


Here, we have 


хх у, = (б 6 x боже) = ху + Yale + хе, + её, 
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Errors are normally small and their products wil] be much smaller. 
Therefore, if we neglect the product. of the errors, we get 
Ke X Yi = хау, + ху + уе, 
= жу» + хау, (е,х„ + е,/у„) 


Тһеп, 

Total erroi +) 
Division 
We have 


х, _х,+е, 
Ye Ya +еу 


Multiplying both numerator and denominator by y, —e, and rearranging 
the terms, we get 


х, _FaYa t+ Vals ~taly C6, 


РД 


Dropping all terms that involve only product of errors, we have 


Zt х,у tYals Tue, 
m у2 


x, 
Total error =e,), = —- 


» 


Again applying the triangle inequality theorem, we have 


x 


(& 


eskan Í 


ey <|*- 
Ya 


Note 1 
8 initial errors е, and e, may be of any type. They may be 
1, empirical errors introduced in the measuring process 
2. roundoff errors introduced in conversion 
3. roundoff errors introduced due to arithmetic operations in the pre- 
vious step, if x, and y, represent some intermediate results 
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4. truncation errors, if x, and y, represent the result of evaluation of 
infinite series 

5. any combination of the above 
Note 2 
The final errors (after arithmetic operations) e, , „ e, _ „, e,, and ey are 
expressed in terms of only e, and е, and do not contain the roundoff 
errors introduced by the operations themselves. This results from the 
need to store the result in floating point representation. Therefore, we 
must add the roundoff error introduced in doing the operation in each 
case, For example, 

„у= Hey +e, 

Now, we can have relative errors for all the four operations as follows: 


Addition and Subtraction 


1е,1+1е,1 
€ xay 
Ix, 5781 
„| "NA 
gr (е, 1+ +7 le, 
Multiplication and Division 


Cray =le,, 21 *le,,l 


€ ay P ler sd le, vl 


Wate the relative error in 2 = x - y when x = 0.1234 x 10* and 
.1232 х 10* as stored in a system with four-digit manti: 


Bince the numbers x and y are stored in a four-digit mantissa system, 
they are properly rounded off and therefore, 


len el zn 10-5 = 0.05% 


le, | six 10-5 2 0.0566 


Then 
0.1234 x 10* x 0.5 x 1072 = 0.617 
= 0.1232 x 10 x 05 ~ 10° = 0.616 
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Therefore 
le, < lel 16,1 = 1233 
1:233 x10 * 


ГЭ] < 1234 0.1233] - 0.6165 = 61.65% 


Although the relative errors in x and y are very small, the relative error 
in z is very large. If we use this result as an input to further calcula- 
tions, the final result will be disastrous. The error due to subtraction of 
two nearly equal numbers is known as subtractive cancellation. 

Rules for error propagation discussed above can also be derived using 
the concepts of differential calculus. We will find this approach more 
convenient when we deal with complex functions. For example, consider 
a power function 


Relative error, 
Cp = x Axe =n x e, 


The relative error in w is n times the relative error in x. 


Sequénce of Computations 


have seen how errors in the operands propagate to the result of un 
operation. As we know, the computer can do only one operation at a 
time. It performs a sequence of operations in order to evaluate ёуеп a 
simple expression; such as 
wax ylz 
In such cases, the result of one operation is stored in the machine in the 
floating point form before it is used as an input for the next operation. 
At each stage of computation, a roundoff error is therefore introduced in 
the result before it is used again. Thus, each stage becomes a source of 
new errors. This is illustrated in Fig, 4.2, for evaluating the above ex- 
pression. The intermediate value u contains the propagated error due to 
error in x and its own roundoff error r}. Similarly, v contains the propa- 
gated error due to errors in y and z and also the roundoff error ry. 
Finally, w contains the propagated error due to errors in u and v and the 
roundoff error гу. 


Roundoff error 
^ 


Roundoff error 
Fig. 42 Block diagram for evaluation of x? + y/z 
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3.45, е, = 0.005 |3.45| = 0.01725 


€ = |х, еу + Yale 
„35 х 0.03370 + 6.74 х 0.01175 = 0.15839 
ey = leny] + le, = 0.15839 + 0.01725 = 0.17564 


Addition of a Chain of Numbers 


As we pointed out earlier, many standard mathematical ideas do not 
hold good in computer arithmetic, One such case is the floating point 
addition. In computer arithmetic, the floating point addition is not al- 
ways associative. That is, 


х+у+т=т+у+х 
The examples 4.11 and 4.12 illustrate this rule. 


Evaluate w = x + y + z, where x = 9678, у = 678 and г = 78. Remember, 
the computer performs arithmetic operations one at a time and from left 
to right. We assume that there is no inherent error (for the sake of 
simplicity) in x, у and z and the length of mantissa is four. 


и = 0.9678 x 10* + 0.0678 x 10! = 1.0356 х 10* 
ш = и +2 = 0.1035 x 10° + 78 
0.1035 х 10° + 0.00078 х 10° 


ш=и+х 
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= 0.0756 x 10* + 0.9678 x 10* = 1.0434 x 104 
= 0.1043 x 105 = 10430 


True ш = 10444 
e,=14 
емъ= 13х10 


Examples 4.11 and 4.12 show that the errors are not the same in both 
the cases. It also shows that the error is less when the numbers are 
arranged in the increasing order of their magnitude. See Example 4,13 
for a more general proof. 


pum |, 55 5 1 


Prove that the procedure w, = (y + z) + x is better than the procedure 
wg = (x+y) +2 1 


(а) Procedure w, 

Let и=х+у 

Тһеп, Res Os г 
же zry ety nitri 


where r, is the relative roundoff error introduced at this stage. 


u 2 
нан Ир ЖИА 
ате PS tes the 


ety! x 7 z 
Ages. ауе en fens + 
1 

= 516: жуе, у HZ, +(х®+у)гү +(х+у+г)л;] 
where S=x+y+z 
Now, let us use 

В = max (le, е, Мел.) R= max (лға) 

Then, we get 


ө grt» +2) ere 2y ez) Ry] 


ёш; = (к + y  z)R, + (Эх + 2y + z)R, 
If we further assume that R, and R; are only due to conversion, then 
R =R, = Ве, = (Зх + Зу + z)R 
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(b) Procedure w; = (y + z) + х 
Similarly we can show that Cw, = (82 + Зу + x)R 
Comparison 
£u, = (3x + Зу + ZR = (Зх + Зу + 32 – 22)R = (38 — 22)R. 
Cy, = (8х + Зу + 3z - 2x)R = (35 — 2x)R 
Since x >z, Cw < Ewy 
Therefore, the procedure w, = (y + z) + x gives better results than the 
procedure w; = (x + y) +2, 


Polynomial Functions 


Suppose we wish to evaluate a function Дх) where f is differentiable and 
the approximate value x, of x is given. In such cases, we can estimate 
the error bound in fix) using the mean-value theorem of calculus. 

According to this theorem, 

Ro) - flag) = (x ~ x f) 
where 6 is some value between x and x, and f' is the first derivative of 
the function f. Then the error in fix) is 
ej2 Ræ) - Rx] = le, FO 

Since the value of @is unknown, we take the maximum of f'() in the 

interval for estimating the bound for ер Then, 
ers e, max |/'(@)| 

This means that we have to evaluate the function /"(0) at various 
values of # and find the upper bound. This is sometimes a difficult task. 

Normally, the error e, is small and, therefore, we can make a reason- 
able approximation as follows 

er e li) 

Note that this e, does not include the errors that occur during the evalu- 
ation of the function itself due to conversion at various stages. 


Estimate the absolute and relative errors for the function 


e, = 0.0005 = 6 х 10-4 


Ра) el +1 Mea 
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fe) + +1= 125 


Then 
ер= 5 х 10* х 1.25 = 6.25 х 10% 


6.2510 + 
араса ЗЕ аи тара 


‘The mean-value theorem approach сап be readily extended to functions 
with more than one variable, using partial derivatives, For functions 
with two variables, x and y, we have 


ep le f, arta esf, Gara) 


where, апа/, denote partial derivatives with respect to x and y. 


у? for x = 3.00 and у = 4.00 


We assume that 
e, =e, = 0.005 


f(x, у) = 2x and /, (х, у) = 2y 


‘Therefore, 


ер= 2x ez + 2y e, 
x 3.00 + 2 х 4,00) x 5 x 10 


ONDITIONING AND STABILITY 


We know that uncertainties exist in all stages of numerical processing. 
ve discussed in detail how these uncertainties, particularly roundoff 
errors, are introduced at various stages and how they are propagated 
during the evaluation of an expression or implementation of a numerical 
method. Induced errors such as roundoff errors accumulate with the 
increasing number of computations in a process. There are situations 
where even a single operation may magnify the roundoff errors to a level 
that completely ruins the result. A computation process in which the 
cumulative effect of all input errors is grossly magnified is said to be 
numerically unstable. It is, therefore, important to understand the con- 
ditions under which the process is likely to be "sensitive" to input errors 
and becomes unstable. Investigations to sce how small changes (or per- 
turbations) in input parameters influence the output are termed as sen- 
sitivity analysis. 

Numerical instability may arise due to sensitivity inherent in the 
problem or sensitivity of the numerical method (or algorithm). This is 
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illustrated in Fig. 4,3. As we know, a mathematical model can be solved 
either by analytical methods or by numerical methods. In either case, 
when a small disturbance in an input parameter (known as inherent 
error) causes unacceptable amount of error in the output, we say that 
the problem is inherently unstable. Such problems are said to be ii- 
conditioned. When a problem itself is sensitive to small changes in its 
parameters, it is almost impossible to make a numerically stable method 
for its solution. 


Changes in 
Input data 


Fig. 4.3 Instability of numerical process 


The term “condition” is used to describe the sensitivity of problems or 
methods to uncertainty. Let us suppose we are evaluating a function Дх) 
and a small change in x produces a change in f(x). We can quantify the 
condition of this function by a number called condition number which is 
defined as follows: 
relative error in f(x) 

relative error in x 
The relative error in fix) is 


AP Paks 


pee 
“Тш fiz) 
The relative error in x is 


Condition number = 


| 
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Condition number = o 
fo 
The condition number provides a measure of extent to which an error in 
x is magnified in fix). If the condition number is large, then the function 
fix) is said to be ill-conditioned and its computation will be numerically 
unstable. There are different situations when a problem can have a 
large condition number. 
1. small f(x) compared to x and f'(x) 
2. large f(x) compared to x and fx) 
3. large x compared to f(x) and f'(x). 
When several parameters are involved, we may have instability with 
respect to some parameters and stability with respect to others. In such 
cases, we should use the partial derivatives to estimate the total change. 
That is, 


+ 


Show that the following system of equations is ill-conditioned for com- 
Mae point of intersection when m, and m, are nearly equal. 

у=түх+С, 

m + Cy 


Solving the equations for x and y we get 


. 01-6 
т-ту 
-С, 
ы сетш. 
Let us assume that С, = 7.00, С, = 3.00, т, = 2.00 and m; = 2.01. Then 
7-3 
*^goi-zoo ^ 109 


у = 2.00 x 400 + 7= 807 
Now, let us change the value of m; from 2.01 to 2.005. Then 
pU sc ЖИ 
© 2.005 = 2.00 ~ 
у = 2.00 x 800 + 7 = 1607 
It shows that a small change (0.25 per cent) in the parameter m. results 
in almost 100 per cent change in the values of x and y. Therefore, the 
problem is absolutely ill-conditioned. 


x 800 
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Compute and interpret the condition number for 


— 388 
Xx- 1) 
The function is numerically unstable for the values of x close to 1. 


Note that the term “ill-conditioned” is ill-defined. If we are to take float- 
ing point seriously then we should say “relatively small changes” and 
“relatively large changes”. 

1f the ill-conditioned effect is present in tho original physical system 
itself, then there is nothing that we can do to achieve numerical stabili- 
ty. In many instances, the ill-conditioning arises from mathematical 
formulation of the problem, In such cases, the instability may be re- 
moved by reformulating the mathematical models. For example, consid- 
er the quadratic equation 

ax + bx +e = 0. 
We know that the two roots are 


-b+ 4b? -4ac -b - yb? -4ас 
xy = 


За ш 2а 


xi 


When b° >> 4ac, Jb? —4ac will be very close to b and therefore, when 6 


is positive, the expression for x, may have the effect of subtractive can- 
cellation. Here, we can reformulate the formula for x, as follows: 


-b+ Jb? лас -b- (62 аас 
is ufi d a на es 


uu -b - Jb? - 4ac 


-2c 


| bab? - 4ac 
If b is negative, we must perform the same operation for x». 
Another approach to the same problem is to change the algorithm of 
calculating х; and ху. First find the larger root from the formula 


-b+ jb? -4ac 
2a 
and then find the smaller roat from the relation 
хуз, = cla 


x= 


^ Compute the difference of square roots of two numbers x = 497.0 and 
96.0. 


Assume x and y are exact. Assuming a mantissa length of 4, 
Ух = 4497.0 = 0.2229 x 10° 
Jy =V496.0 = 0.2227 х 10° 
z2dx-dy = 0.0002 x 10° = 0.02 
Let us try another approach by rearranging the terms as foliows: 


NE NES 
edges m 


ml 202244 x 10! = 0.02244 
0.4456 x 107 


The correct answer is 0.02244. This shows that by rearranging the terms 
we improve the result. 


г algorithm to calculate B is to find the factoriala n), (л — r) and 


r! and combine them to get B. That is, 
Е; 

СР, ХЕ, 

where P, = n!, Fy = (л — r)! and Fy =r! 

The problem with this algorithm is that when л is large, the factorial л! 

may be too large for the computer to store and thus, may result in 


overflow error. This problem can be overcome by modifying the algo- 
rithm as follows: 


B 


fi jeans 


= 


Ban for r-1 
This can be expressed recursively as 


r n+ 


i 


B-n 
This algorithm will compute B without causing an overflow error unless 
the final answer itself is too large. 
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Reformulate the following expressions to avoid loss of accuracy due to 
subtractive cancellation. 


(a) x - yx? – 1 for large x 


1-cos x 
(b) — 


for small x 


1-cosx 


(b. =- 
i sinx 


sin?x sinx 
inx(l*cosx) 1+ совх 


Куеп when the problem is formulated in a reasonable way and the input 
data is accurate, the method of solution may make the process unstable. 
For example, in a step-by-step algorithm where we use an interval Л to 
increment a variable, the error may increase if h is decreased (or in- 
creased beyond some limit). If such induced errors are large, then our 
method of solution may exhibit what is known as induced instability. 
Another example is the “pivoting” technique used in solving simulta- 
neous linear equations (see Chapter 7). Here, pivoting can make a well- 
conditioned system into an ill-conditioned one, if proper care is not tak- 
en in the design of algorithm. 


== 


Show that the series е" cie NR 


becomes unstable when x = —10. 


The series can be represented as 


А 
Six) = ÈT; + Te 
1=0 
where 
xi 
= 


Те is the truncation error. 

For —1 <x < 1, T; decreases as i increases, but for large values of |x|, T; 
will grow in magnitude until the factorial in the denominator dominates, 
when once again Т, will decrease in size. When x = —10, we have: 
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^ i 
Ñ -10 

Ty 50 

f. -166.66767 
Ta 416.66767 
T. —533.33333 
Pa 1388.8888... 


Assuming a six-digit mantissa machine, the roundoff errors in repre- 
senting th- large values of T, will be of greater magnitude than t!.o final 


''#- б 45 x 10% itself. Therefore, the roundoff error 42:7, i- 
yuted solution. The method becomes unstal!s, 


t т be overesn2 b; ga cimz 
known as the range reduction scheme for x. We know that 
Fae 
Thus, 
2719 = (е71)10 = ((е-05у3у10 


ЁК CONVERGENCE OF ITERATIVE PROCESSES 


As pointed out earlier, most of the numerical computing processes are 
iterative in nature. We start with an approximate value of the solution 
and compute iteratively the next approximate value till the difference 
between two consecutive values is negligible or within a specified limit. 
The number of iterations required to reach the given limit depends on 
the rate at which the iterates converge to the result. 

Suppose that x, i = 0, 1, 2,... is a sequence of iterates and x is the 
expected value of x. Let e, be the error in the iterate х, Then 


e-x-x  foreachi 
We would like the iterates to converge to x and this would happen if 


the numerical process is stable. The process is said to converge if there 
exists positive constants р and с such that 


lal 


nae (le ^ 


The constant p is known as the order of convergence and c is known as 
asymptotic convergence factor. This shows that if the error in x,,, is 
proportional to the pth power of the crror in x, (i.e. the previcus iterate), 
then the iterative method is said to be of order of p. It is ciear that the 
higher the order of iteration, more rapid is the rate of convergence. 

The rate of convergence is a measure of how fast the truncation error 
goes to zero. This measure is used for comparing various iterative 
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methods. The rate of convergence is expressed in different ways. For 
example, if the method converges like h?, the order of convergence is №, 
and so on. It means the value of p is 2. We shall consider the rate of 
convergence in detail when we discuss the iterative methods later. 


A ERROR ESTIMATION 


It is now clear that it is almost impossible to know the exact error in a 
computed result. Nevertheless, it is possible at least to have some esti- 
mate of the error in the final result. There are three approaches that are 
popularly used in error estimation: 
1. forward error analysis 
2. backward error analysis 
3. experimental error analysis 
In forward error analysis, we try to estimate error bounds in the 
computed result using information such as uncertainties in the input 
data and the nature and number of arithmetic operations involved in 
the computing process. We can estimate the contribution due to 
1. errors in the input data 
2. roundoff errors in arithmetic operations 
3. truncation of the iterative process 
4. errors in formulation of the model 
For example, we have seen in section 4.9 that the total error of a sum 
of three values is given by 
e, $ (x+y +2)R, + (2x + 2y + ZR 
where 
R, = тах (|е„|, leyl, 161) 
Ry = max (ril, Ira) 
‘This can be easily generalised for addition of n values: 
e, Sl, + х; +... + х„)}Ву + |(п— 1 + {n - 1x 
+ (n= 2)ху + + 2х„_у+х„]В) 


x 2-1 
=R x; +R fin- Dx, tE tanin] 
iz L 11 


Similarly, we can estimate bounds for product of n numbers. 

Error estimated through forward analysis is always pessimistic and is 
often much higher than the actual error. 

In backward error analysis, we try to show that the computed results 
satisfy the problem within the given bounds. For example, we can put 
back the roots computed in the equation and see to what extent they 
satisfy the original equation. By comparison, we can then decide on how 
much confidence we can place in the computed results. Backward analy- 
sis is usually easier to perform than forward error analysis. 
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Experimental error analysis involves a series of experiments by using 
different methods and step sizes and then comparing the results. We 
may also perform sensitivity analysis to see how any change in parame- 
ters affects the result. 

When the application is very critical in nature (such as space and 
defence applications) the problem may be solved by more than two inde- 
pendent specialists groups and the results can be compared. 


aay] MINIMISING THE TOTAL ERROR 


Assutning that the mathematical model has been properly formulated 
ана the input data аге accurate, the total numerical error primarily 

consists of two components, namely, truncation and roundoff errors. Any 
effort to minimise the total error should, therefore, be concentrated on 
the ways to reduce these two types of errors. The steps may include: 

1. increasing the significant figures of the computer 

2. minimising the number of arithmetic operations 

3. avoiding subtractive cancellations 

4, choosing proper initial parameters 

In many iterative processes such as numerical integration, it is possi- 
ble to minimise the truncation error by decreasing the step size. But this 
would necessarily increase the number of iterations and thereby, arith- 
metic operations. This would certainly increase the roundoff error, This 
phenomenon is illustrated in Fig.-4.4. We must, therefore, judiciously 
choose a step size that would minimise the sum of these errors, 


Total error 


k A Truncation error 
-] 5s ee 
— сы. 


Roundott error 


——- 
Step size 


Fig. 44 Dependence of error cn step size 


Eua] PITFALLS AND PRECAUTIONS 


We have seen that the floating point arithmetic system is full of pitfalls 
such as conversion, roundoff, overflow and underflow errors. In many 
cases, we may have to consider some precaution techniques to get the 
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most accurate results. The type of precaution techniques that might be 
used depends both on the computer hardware and the nature of the 
mathematical models. Here are some hints that might help improve the 
accuracy of the results. 
1. Rearrange the formula so that you can avoid subtraction of two 
nearly equal numbers. For example, 


х®-у? 
х-у 
сап be replaced by 
x+y 


when x and y are nearly equal. 

2. If necessary, use double precision for floating point calculations. 
This would improve the accuracy considerably but would take more 
execution time and computer memory space. 

3. Rearrange your formula to reduce the number of arithmetic opera- 
tions, An example is evaluation of a polynomial. The polynomial 

ах" +, ax" +... +00 
may be rearranged as 
(„бах +а„ ух +а„ү}х.. + ao) 
‘This requires much less arithmetic operations. 

4. When finding the sum of set of numbers, arrange the set so that 
they are in the ascending order of absolute value. ‘That is, when 
la] > |b] > [c], then (c - b) + a is better than (a — b) + с. 

5. Wherever possible, rearrange your formula so that you use the 

original data rather than derived data. 

. Do not test a floating point number for zero in your algorithm. 

‚ Wherever possible, use integer arithmetic to avoid conversion and 

roundoff errors. 

8. Avoid multiplication of large numbers that may lead to overflow. 

9. Use alternative arithmetic such as interval arithmetic, if necessary. 


БШ SUMMARY 


In this chapter, we studied various types of errors and how they can 

affect numerical calculations. We considered, in particular, the following: 
e concept of significant digits and its relation to accuracy and preci- 

sion of numbers 

inherent errors that are present in input data 

procedural errors introduced during the process of computing 

• modelling errors that arise due to certain simplifying assumptions 
in the formulation of mathematical models 

importance of absolute and relative errors and their relation to the 
machine epsilon 


no 
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* propagation of errors during computing and how it affects the result. 

+ causes of numerical instability and how to overcome instability 
problems 

* convergence of iterative processes 

* estimation of errors and some steps that might help to reduce the 
final error 


Absolute error Input error 
Accuracy Machine epsilon 
Algorithm Mean-value theorem 
Asymptotic convargence factor Modelling error 
Backward error analysis Numerical error 
Blunder Numerical instability 
Chopping Numerically unstable 
Condition number Order of convergence 
Conditioning Perturbations 
Convergence Precision 
Conversion error Procedural error 
Data error Rate of convergence 
Discretisation error Relative error 

Drag coefficient Representation error 
Empirical error Rounding 

Error propagation Roundoff error 
Experimental error analysis Sensitivity analysis 
Forward error analysis Significant digits 
Нитап error Stability 
Ul-conditioned problem Subiraclive cancellation 
Induced errors Symmetric rounding 
Induced instability Triangle inequality 
inherent error Truncating 
inherently unstable Truncation error 


1. Why is the study of errors important to a computational scientist? 
2. Explain the concept of significant digits. 
3. Describe the relationship between significant digits and the follow- 
ing: 
(a) round-off errors 
(b) accuracy 
(c) precision 
What are inherent errors? How do they arise? 
^ Distinguish between roundoff errors and truncation errors. 
6. What is chopping? When does it occur? 
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7. What is symmetric round-off? Show that the symmetric error is, at 
worst, one-half the chopping error. 
B How does a truncation error occur? Give two examples. 
9. How do mathematical models contribute to errors in numerical 
computing? 
What are blunders? How can we minimize them? 
TN ‘What do you mean by relative error? How is it important in error 
analysis? 
12. What is machine epsilon? How is it related to significant digits? 
13. State and explain triangular inequality as applied to error propa- 
gation. 
4, What is subtractive cancellation? How does its presence affect the 
result of a computation? 
5. Define condition number. What is its significance to numerical com- 
puting? 
16. What is range reduction technique? Give an example of ils applica- 
tion. 
17. How will you decide the convergence of an iterative process? 
18. Explain briefly the three approaches used in error analysis. 
19. In an iterative process, how does step size affect the total error? 
20. Enumerate a few precautionary steps that might help improve the 
accuracy of numerical computing. 


1. Find the accuracy and precision of the following numbers: 


(a) 12.345 (d) 750 
(b) 0.0002932 (e) 750.5 
(c) 0.0029320 (D -68.3705 


2. Add the decimal numbers 0.4 and 0.65 in binary form using 6 
binary digits and then estimate the error in the sum. Show that the 
error can be reduced by using more binary digits to represent. the 
numbers. - 

3. Find the round-off error in the results of the following arithmetic 
operations, using four digit mantissa. 

(a) 277.65 + 22.20 
(b) 87.26 + 31.42 
(c) 1250.0 x 40.0 
(d) 3543.0 x 16.78 
(e) 25.68 « 6.567 
(f) 456.7 — 1.531 
(g) 456.7 ~ 4.566 

4. Calculate absolute and relative errors in the arithmetic operations 

in Exercise 3. 


ES 
x 
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10. 


- Estimate the relative error of the final result in evaluation of 


(a) w, = (x + yk and 
(b) wy = х? + ylz 
Given that x = 1.2, у = 25.6 and z = 4.5. 


. Find the absolute and relative errors in evaluating the following 


expressions: 
(а) yx? +y? 
(Ы) xe” 
Assume x = 1.25 and у = 2.16. 


. Find out which procedure (р; or p;) produces better results: 


(а) ру = хіх + 2), р» =x" + 2x 
(b) ру = (х + 1) (x + 2), ра = +3) +0 


. Determine the condition of the following functions: 


(а) ftx) = sin(x) 
(b) Ах) = 10 -2) 
(с) fix) = xë 

(d) fa) = х! 


. Rearrange the following expression to avoid loss of accuracy due to 


subtractive cancellation: 
(а) cos x — sin x for x close to 45* 


(b) J1«x - /1-x for small x 
(c) 1- cos x for small x 
(d) dx? +1-х for large x . 
(e) Ini + 1) - In) for large x 
Estimate the maximum error in evaluating the expression 
a? - 25x? + 3.1 — 1.5 at x = 125 
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[Бр] NEED AND SCOPE 


After a sound algorithm and a detailed flow chart comes the develop- 
ment of computer program, known as coding. Codes are written in a 
high-level computer language. Hundreds of high-level languages have 
been developed during the last four decades. Among these, a few have 
direct relevance to numerical computing. They include, among others, 
BASIC, FORTRAN, C, and C++. 

FORTRAN, which stands for FORmula TRANslation, was the earliest 
scientific language developed in the 1950s. Since it was specially designed 
for mathematical computations, it has been the most widely used language 
for scientific and engineering applications. It is well suited for 
implementing the numerical methods discussed in this book. іп spite of 
development of numerous other languages, FORTRAN continues to play 
а dominant role in engineering applications. Consequently, we are going 
to use FORTRAN for developing programs for implementing our 
algorithms. Our programs and algorithms are concise and general enough 
to be used as the basics for developing programs in other languages, if 
necessary. 

A complete description of FORTRAN 77 is beyond the scope of this 
book. We only give here an overview of the language. However, enough 
material has been included so that the reader can easily understand the 
programs given in the book and also modify and implement them 
effectively. Wherever necessary, FORTRAN 90 features are also included. 


[БУЗ A SAMPLE PROGRAM 


For solving any problem in FORTRAN, we have to write a sequence of 
instructions using certain statements known as FORTRAN statements. 
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These instructions are required by the computer to perform the follow- 


ing tasks; 
1, get data into the computer memory 
2. perform arithmetic and logical operations on data 
3. provide results on an output media 


Program 5.1 


* Main program 
ж A program to evaluate a function at different 
points 


ions invoked 


Subroutines used 


* Variables used 

* X - Independent variable 
* Р ~ Function value 
* 

D 

* 

. 

B 


COUNT — Counter to store number of evaluations 


Constants used 


REAL X, F 

INTEGER COUNT, N 

PARAMETER( N = 5 ) 

WRITE(*, *) 'Input value of X' 
READ(*, *) X 


WRITE(*, *) ' OUTPUT OF SAMPLE РВОСКАМ'' 
WRITE(*, *) * x F á 
COUNT = 0 

100 Fex*x 
WRITE(*, *) X, F 
Ru ke xk 


COUNT = COUNT + 1 
IF( COUNT .LT. N ) GO TO 100 
STOP 
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А sample FORTRAN 77 program to evaluate the function fix) = x? for n 
values of x is shown in Program 5.1. When we run this program, it 
displays first the following message: 
Input Value of X 

and then waits for the input from the keyboard. Let us enter a real 
value, say 1.0 and then press the RETURN key. Execution now continues 
and produces the following output on the scrcen: 
Input value of X 
1.0 

OUTPUT OF SAMPLE PROGRAM 


x 
1.0000000 
2.0000000 
4.0000000 16.0000000 
8.0000000 64. 0000000 
16.0000000 256.0000000 
Stop — Program terminated. 


Program 5.1 illustrates some of the FORTRAN statements and the overall 
format of a FORTRAN 77 program. This program is intended to give 
only an overview of a FORTRAN ‘program. The details of FORTRAN 
features will be discussed in the sections to follow. 

The first line of Program 5.1 is a FORTRAN statement known as 
program unit header or program statement. This statement is not essential 
in all systems, You must consult the system manual before using it. 


[This is а recommended style in FORTRAN 90. | 


The lines starting with * or C in the first column are known as comment 
lines (only C in the FORTRAN IV version). These lines are used to insert 
explanatory remarks to help readers to understand the program. They 
are not instructions to the computer and, therefore, they are ignored by 
the compiler. Comment lines should be used liberally to explain various 
aspects within the program. 

[ FORTRAN 90 permits the use of the character T in 


the first column to mark a comment line. This can 
also be used as an in-line comment. 


L 
The next two lines 
REAL X, F 
INTEGER COUNT, N 
declare the types of storage associated with the variables. That is, the 


variables Х and F are declared as type real and COUNT and N as type 
integer. These statements arc called type declaration statements. 


In FORTRAN 90, they are written as REAL:: X, F 
and INTEGER:: COUNT, N. 
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The identifier N represents the number of points at which the function 
is evaluated. The value of N is going to be constant throughout the 
program execution. Such identifiers are known as symbolic constants or 
named constants. Symbolic constants may be given values using a PA- 
RAMETER statement. (Some version of FORTRAN 77 may not include 
the feature of PARAMETER statement). The value of a symbolic con- 
stant cannot be changed during execution. 

Some variables need to be given initial values like 

COUNT - 0 
before they are used in any expression. The process of setting variables 
to initial values is known as initialisation. 

"The set of statements 

PRINT *, ‘Input Value of X' 


IF (COUNT .LT. N) GOTO 100 
is known as processing block. It includes all executable statements such 
as input/output statements (READ, WRITE), assignment statements and 
control statements (such as If). Note that the value of х? is evaluated 
and assigned to the variable F in this block. Similarly, the variables 
coun? and Х are incremented in this block. The statement 

IF (COUNT .LT. М) GOTO 100 
is known as a control statement. 

‘This statement is responsible for creating a loop of operations and 
thereby making the function evaluated exactly N times. This is done 
with the help of the variable COUNT, usually known as a counter, which 
keeps counting the number of times the function has been evaluated. 

Note that the statement 

сото 100 
directs the control to the statement 

100 = Хх" X 
‘The number 100 is known as statement number or statement label. We 
need to use labels only to those statements to which the control is 
transferred from another part of the same program. 

‘The last statement in our sample program is the END statement. This 
statement (which is a must in every FORTRAN program) serves two 
purposes: 

1. it marks the end of source code during compilation 

2. it terminates the execution of the program 


In earlier versions of FORTRAN, we need to use two 
statements: 

STOP — to stop the execution of the program 

END — to mark the physical end of the program. 
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FORTRAN 90 implements the same as follows: 
END PROGRAM SAMPLE 


Note the structure of the sample program. A FORTRAN program 
generally consists of a series of blocks of code in the following order: 
Program name 
Program description 
Variable declaration 
Initialisation of symbolic constants 
Initialisation of variables 
Executable statements 
The END statement 
FORTRAN requires certain coding formats to be followed. Table 5.1 lists 
them. 


Table 5.1 FORTRAN 77 line format 


Columns Use 


I For typing the comment character. 
1-5 For typing statement number. 
6 For typing a non-zero FORTRAN character to indicate 
that the previous statement is continued. 
7-72 For typing FORTRAN statement. The statement сап 


begin anywhere in the region. 
Not used (or used for typing line number), 


FORTRAN supports the following major programming elements that 
have direct relevance to numerical computing discussed in this book. 

1. constants 

2. variables 


3. input-output instructions 

4. computational instructions 

5. control instructions 

6, documentation remarks 

7. subprograms 
The sample program has illustrated the use of all the first six elements. 
We shall discuss further details about them as well as the last element 
in this chapter. 


pea FORTRAN CONSTANTS 


Constants are the means by which numbers and characters are 
represented in a program. They are quantities that do not change. 
FORTRAN supports the following five built-in data types: 

1. Integer type 

2. Real type 

8. Complex type 
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4. Logical type 

5. Character type 

Integer constants are numbers that do not contain decimal points (i.e. 
whole numbers). They can be positive, negative or zero. For examples 

25 -10 0 +123 

Real constants are numbers containing decimal points. They may be 

expressed in positional form or exponential form. Examples: 
12.5 -1.756 0.0 5. (Positional form) 
.23 Е+09 15ЕЗ -2.3E -5 (Exponential form) 

The exponential form (also known as scientific form or floating point 
form) is used where yery large or very small numbers are to be written 
but not all digits need to be represented. (Number of significant digits 
depends on the computer.) 

Complex constants are ordered pairs of real numbers, separated by 
commas and enclosed in parentheses, like (a, b). Examples; 


(3.0, 4.0) (-1.0, 0.9252) (1.2 E-2, 4.1 Е1) 


The first number is called the real part and the second is called the 
imaginary part of the complex number. (Complex numbers are usually 
written in a + jb notation in mathematics.) 

Logical constants are data that are used to represent the two truth 
values "true" and “false”, Therefore, there aro only two logical constants 
which are written as 

= TRUE. 
«FALSE. 

Character constants represent a string of characters enclosed in apos- 

trophes (single-quotes). Examples: А 
‘John’ ‘January 26, = ‘NEW DELHI 20, 123, 


In FORTRAN 90, we may also use double quotes, 


eee FORTRAN VARIABLES 


Variables represent quantities that can change in value. In FORTRAN, 
they indicate storage locations where the values are stored. These values 
can be changed whenever required. 

A variable name may consist of one to six characters, chosen from the 
letters A through Z and 0 through 9, the first of which must be a letter. 
Examples; 

ALPHA Xi SUM NAME 


| FORTRAN 90 permits names with a length of 31 


characters and also allows the use of underscore character. 
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We can have longer names in FORTRAN 90. Examples: 
DISTANCE_TRAVELLED AVERAGE_HEIGHT 
This facilitates to create more meaningful names. 


All variables must be declared for their storage types corresponding 
to the five data types discussed in section 5.3, namely, integer, real, 
complex, logical and character. Examples: 

REAL NUMBER, SUM, X1 

INTEGER TOTAL, COUNT, Ү 

COMPLEX ЕООТІ, Z 

LOGICAL PACKED, L 

CHARACTER * 20 NAME, СІТҮ 
The variables NAME and СІТУ can hold up to 20 characters. We can also 
increase the number of significant digits held in a real type variable by 
declaring it a “double precision" variable as follows: 


DOUBLE PRECISION SUM, Xi 


| Declaring a variable creates a storage location of 

appropriate type but it does not store any initial value. It 
| contains some unknown bit pattern stored previously, i.e. 
| the variable contains garbage. 


Any variable that is not declared explicitly for its type assumes default 
(implicit) type as follows: 


Names beginning with Type 


~ Any letter I through N — Integer 
Any other letter Real 


E SUBSCRIPTED VARIABLES 


FORTRAN variables can have subscripts to store a set of related values 
in one-dimensional vector or multidimensional matrices. A subscripted 
variable is called an array. 

An array can be used to represent a collection of data of the same iype 
and the subscripts can be used to access the individual data items. For 
instance, the third element of a one-dimensional array X is given by 
X(3). Examples of array variables are: 

NAME(2, 10) CITY (5) GRADE (I) 
We can use integer variables to represent subscripts and by assigning a 
suitable value to the subscript variables, we can access the desired 


element of the array. 
All array variables must be declared for their type and size. Example: 


100 Numerical Methods 


REAL X(10) 
or 
REAL X(1:10) 
Both these statements declare X as a one-dimensional array with elements 
numbered 1 to 10, which are of type real. The second form specifies the 
lower and upper bound of the subscript. In this form, the value of either 
bound may be positive, negative, or zero. The value of the upper bound 
should be greater than the value of the lower bound. Examples: 
INTEGER X(0:5), M(- 20), N(-5:0) 
REAL P(5,5), VALI 3:3,5) 
The second line declares Р and VALUE as real type, two-dimensional 
arrays. 
We may also declare type and size in separate statements like 
REAL Х, М 
DIMENSION Х(10), М(0: 10,10, 0:20) 
Character arrays are declared as follows: 


CHARACTER * 30 NAME(40) 


or 

CHARACTER * 30 МАМЕ (1:40) 
where the number 30 specifies the maximum number of characters to be 
stored in an array element. 


In FORTRAN 90 arrays may be declared as follows: 
REAL, DIMENSION(1:9) Y 

CHARACTER(LEN - 
LOGICAL (-5:5) FOUND 
CHARACTER, DIMENSION(!0) 


Е] INPUT/OUTPUT STATEMENTS 


Input/output statements are data transfer statements that are required 
in every program. FORTRAN supports two kinds of I/O statements 

1. list-directed VO statements 

2. format directed I/O statements 
We have already seen (in sample program 5.1) the use of list directed ПО 
statements. They are 


NAME 


READ *, X 
PRINT *, F 
The general form of these statements are: 
VEND ар Чү. Wü a wy MR 
PRINT *, ү. ч, 


Bos eso Ча 
where v, Vo, ...‚ v, are data items. READ * reads input data from the 
standard input device, usually the keyboard, and assigns them to the 
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READ *, A, B, COUNT 
PRINT *, X, ‘TOTAL’, SUM, 40.75 
READ * is usually used to provide input data interactively through the 
keyboard. The data should be entered with either a comma or one or 
more spaces between the items, 
Format directed UO statements are used when the data should be 
read or written using a specified format. The general form of format 
directed 1/О statements are 


READ(n,, n;) Vi, Vz, ... 
WRITE, nj у, v, 


where n, is the number assigned to the device giving input or receiving 
output and г, is the number of the FORMAT statement which specifies 
tho format of input data or output values. The FORMAT statement (a 
non-executable statement) takes the following form: 
72 FORMAT (list of specifications separated by commas) 
Examples: 
READ(S, 100) X, у (Reads values from unit 5) 
WRITE(6, 200) X, Y, SUM (Writes values to unit 6) 
The unit may refer to keyboard, screen, printer, disk drive, and so on. If 
же are using only the standard devices as specified by the computer 
system, then we can use the following forms; 
READ(* ,.100) x, v 
WRITE(* , 200) X, y, SUM 
The FORMAT statements may look like 
100 FORMAT (15, FS.2) 
200 FORMAT (110, F7.2, F10.2) 

The letter 1 indicates that the number to be handled is integer and F 
indicates that the number is floating point type. For more details about 
format specifications, you must consult the manual. 

We may also give initial values to variables using the DATA state- 
ment as follows: 

DATA X, Y / 25, 7.25 у 

This statement assigns 25 to X and 7.25 to v. 


BS COMPUTATIONS 


FORTRAN was specially designed to evaluate complex mathematical 
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expressions. We can write а FORTRAN expression for a given 
mathematical expression and assign it {о a variable using an assignment 
statement as follows: 
у = expression 

This statement directs the computer to replace the previous value of the 
variable on the left-hand side of the equals sign with the result of the 
expression on the right. The expressions are written using variables, 
constants and arithmetic operators (see examples shown in Table 5.2). 


Table 5.2 FORTRAN expressions 


Algebraic expression FORTRAN expression 

а=х+®-г% лә», YZ-R**2 
z 

pa ag B= (х + уа + КТ 
2 


= (ху) (z + 2) C « (x * Y) * (Z 4 2.0) 


The following are the accepted arithmetic operators in FORTRAN: 
+ Addition 
— Subtraction or Unary minus 
/ Division 
* Multiplication 
** Exponentation 
According to the precedence rule 
1. all exponents are performed first, all multiplications and divisions 
next, and all additions and subtractions last, 
2. for the same precedence, the operations are performed from left to 
right, and 
3.,when parentheses are used, the expressions are evaluated from 
innermost to outermost parentheses (using the same precedence 
rule each time) 
In numerical computations, we often come across an assignment 
statement of the type 
SUM = SUM + N 


This means, replace the “old value" of SUM by the “new value”, 


Mixed-Mode Expressions 

It is possible to combine integer, real and double precision quantities 
using these arithmetic operations. Expressions involving different types 
of numeric operands are called mixed-mode expressions and are evaluated 
as shown in Table 5.3, 
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Table 5.3 Mixed-Mode Evaluation 


Mixed-mode expression Evaluation Result type 
integer op real Convert the integer to the corre- Real 
sponding real value and evaluate the 
expression. 
integer op double Convert the integer to the corre- Double precision 
precision sponding double precision value and 


evaluate the expression. 
real op double Extend the real to a double precision 
precision value (by adding zeros) and evaluate 
{һе expression. 


КЕЕ CONTROL OF EXECUTION 


Control of execution means the transfer of execution from one point to 
another in the same program, depending on the conditions of certain 
variables. This may involve a forward jump thus skipping a block of 
statements, or a backward jump thus repeating the execution of a block 
of statements. This is known as conditional execution of statements 
Examples of such conditional execution are: 

1. If the value is negative, skip the following four statements. 

2. If the item is the last one, go to the end. 

3, Execute the following ten lines 100 times. 

4. Evaluate the following statement until a given condition is satis- 

fied. 

FORTRAN contains two central structures which could be used to 
implement such conditional execution of statements. They are 

1. IF-ELSE structure 

2. DO-WHILE structure 


Block IF-ELSE Structure 


The block IF-ELSE structure (also known as selection structure) consists 
of a logical expression that teste for a condition or a relation followed by 
two alternative paths for the execution to follow. Depending on the test. 
results, one of the paths is executed and the other is skipped. This is 


Double precision 


Enter Enter 


] [ Bicek 2] [Block 1] 
Continue Continue 
(a) Statements in both paths (b) Statement in only one path 


Fig. 5.1 Flow chart of IF-ELSE structure 
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illustrated in Fig. 5.1. 
The FORTRAN statement to code a block IF-ELSE structure takes the 
form: 


IF (logical expression) THEN- 
statement-block 1 

ELSE 

statement-block 2 


IF block 
ELSE block 


The statement blocks may contain zero or more statements. If the logical 
expression is true, the program executes statement-block 1 and then 
goes to the statement next to the END IF statement; if the logical 
expression is false, the program executes statement-block 2 (skipping 
statement-block 1) and then ges to the. statement next to the END IF. 


Relational Expressions 


Relational expressions are meant for comparing the values of two 
arithmetic expressions and have logical values .TRUE. or .FALSE. as 
results. Arithmetic expressions may contain single variable, simple 
constant, intrinsic function, or a complex expression. In numerical 
computing, we often угат. our programs to test for certain relationships 
and make decisions based on the outcomes. We may use the relational 
operators given in Table 5.4 for comparing the expressions. 


Table 5.4 Relational operators 


Meaning 


Less than 

Less than or equal to 
Equal to 

Not equal to 

Greater than 

Greater than or equal to 


Examples of rational operators are 
1. IF(X .LT. Ү) THEN 
PRINT * ‘Small is’, X 
ELSE 
PRINT * ‘Small is’, Ү 
END IF 
2. IF(TOTAL .СТ. 1000) THEN 
TAX = 0.15 * TOTAL 
ELSE 
TAX = 0.10 * TOTAL 
END IF 
PRINT * 'GRAND TOTAL = ^, TOTAL + TAX 
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3, IF(C - D .GE. A - B) THEN 


X20C-D 
ELSE 

Х= А-В 
END IF 


When arithmetic expressions are used along with 
the relational operators, arithmetic expressions are 
| evaluated first and then the results are compared. 


Logical Expressions 


In some cases, we may need to make more than one comparison, It is 
Possible to combine two relational expressions using the following logical 
operators: 


-AND. Both relations are true 
„Ок. Опе ог both of the relations are true 
.NOT. Opposite is true 


Such expressions ure known as logical expressions. 
Examples of logical expressions аге: 


1. IF(SUM .GT. 100 .OR. N .GT. 20) THEN 
ELSE 


END IF 
2, IF(AGE .LT. 30 .AND. DEGREE -EQ. 'ME') THEN 


ELSE 


END IF 


FORTRAN permits nesting of IF-ELSE blocks. That is, we can place 
an IF-THEN-ELSE code within an IF block or ELSE block. 


Warning! 
Be careful when comparing real values. They are never 
exact! 


We may also use the following relational operators in 
FORTRAN 90. 
< Less than 
Less than or equal to 
Equal to 
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/= Not equal to 
>= Greater than or equal to 
> Greater than 


DO-WHILE Structure 


The DO-WHILE structure (also known as looping structure) performs a 
set of operations repeatedly while a certain condition is true. When the 
condition is not true, the repetition ceases. This kind of structure is 
implemented in FORTRAN by the DO statement. The general format of 
DO statement is: 


DO n i = е, е, е, 


> - —— — Body of the loop 
n CONTINUE 


n number of the last statement in the loop 
i loop control variable 

e, initial value of the control variable 

e, final value of the control variable 

e, increment value. 

The control variable i may be a real or integer variable. The parame- 
ters €}, e, and e, may be real or integer variables (or expressions or 
constants). 

The default value of c; is 1. The logic of DO loop is as follows: 

1. initialise the loop control variable to the initial value e, 

2. test to see if the value of loop control variable is less than or equal 

to the final value e;. If it is true, continue the loop; otherwise exit 
the loop 
. execute the body of the loop 
. increment the loop control variable by е; 
. go back to step 2 (beginning of the loop) 
This can be written in pseudocode form as follows: 
ї = е, 
DO WHILE i <= e; 
execute statements 
iei+e 
END DO 

Figure 5.2 shows a flow chart showing the execution of the DO struc- 
ture. The number of times the loop is executed (unless terminated by an 
EXIT statement) is given by the formula 


оф 


[x] denotes the greatest integer less than or equal to x. 
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Fig. 2 foe оис tortie DO lop 


Examples of DO loop are 
1. DO 10 Р = x/Y, 


20 CONTINUE 


4, DO 30 N = 2, 20 


30 CONTINUE 
4. DO 40 J - 1, 100 


IP (...) GOTO 50 (Exit from the loop) 
40 CONTINUE 


| ~ Warning 7 

| Avoid the use of real variahles fur DO loop parameters. They 
cause roundoff errors and, therefore, cannot always guarantee 

| the correct number of loop executions, 


А DO loop can contain DO loops within its range. This is known as 
nesting. When nesting DO loops, the inner loop must be entirely contained 
within the range of the outer loop. 
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Examples of nesting DO loops are 


1. DO 200 І = 1, 10 ————7]outer loop 
DO 1003 = 1, 5 —4 
PRINT *, I * J þef inner loop 
100 CONTINUE —— M 


200 CONTINUE ——— 


DO 500 I = 1, 10 ——— —4 
bo 200 J = 1, 10—4 jouter loop 


200 CONTINUE ~ 


inner loops 


300 CONTINUE 
500 CONTINUE 


The general form of DO structure in FORTRAN 90 is: 


Do loop control 


block of statements | 

END DO x | 
This is implemented in two forms: 
Form 1 

DO i = е, & е, 

END DO 
Form 2 
1. р 


(...) EXIT —- 
END DO Leave the loop 
© Go to the beginning 
IF (...) CYCLE 


— ) 
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SUBPROGRAMS 


One of the features of any modern programming language is the provision 
for subprograms. A subprogram is a separate program unit that can be 
called into operation by other programs. Subprograms are heavily used 
in numerical computing for tasks such as evaluation of a function, matrix 
multiplication, sorting, reading a table of values, printing a report, etc. 

The concept of subprograms allows us to break a complex problem 
into subtasks so that we may develop subprograms and later integrate 
them into a single program known as driver or main program. These 
subprograms can be independently designed, coded, and tested, 
Subprograms are usually called modules and the programming approach 
using modules is called modular programming. 

FORTRAN supports two kinds of subprograms, namely, functions and 
subroutines. A function subprogram returns a single value to the calling 
program while a subroutine subprogram can compute and return several 
values. 


Function Subprograms 


A function subprogram (or simply a function) is an independent program 
unit written to compute and return a single value. It takes the following: 


ON name (ary 


| arsi) 
| Declaration of argument types | 
| 


| 
SS as Execution statements 


name - expression | 


END | 


where гуре specifies the type of the function value that is being returned 
and arguments are dummy variables that must be declared for their 
type inside the function. They may vary in number from zero to many. 
There should be at least one statement of the form 


name = expression 
which assigns a value of appropriate type to the function name, which is 


in turn returned to the calling program. 
A function can be called as follows: 


variable name (arguments) 


When the function is called, the values of the arguments in the calling 
statement are assigned to the corresponding arguments in the function 
header. The arguments, therefore, must agree in order, number and 
type, An argument may be a variable name, an array name, or a 
subprogram name. Example: 
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PROGRAM MAIN 


REAL A, В, В; МЛ, -«— — MUL declared in main 
READ * A, B 

R = MUL(A, В) -———— Calling MUL 

PRINT * , R 

END 

REAL FUNCTION MUL(X, Y) 4———- MUL defined 
REAL X, Y 

MUL = xX * ¥ 

RETURN 

END 


When an array is passed as an argument, then its corresponding dummy 
argument should be an array variable and its size must be declared 
properly. Note that a function may be called and used in an expression, 
like any other variable. Example: 


R = A * MUL (А, В) 


Subroutine Subprogram 

А subroutine, unlike a function which always returns only one value, 
can return many values (or no values). Therefore, we use a subroutine 
when either several values are to be computed and returned or no values 
are to be returned (such as printing the values of some variables). The 
general structure of a subroutine is: 


SUBROUTINE rame (arguments) 
Declaration of arguments 


Execution statements 


RETURN 
END 
where name is the subroutine name and arguments are dummy variables 
that must be declared for their type. When subroutine has no arguments, 
the parentheses are omitted (note that in case of function, parentheses 
are necessary even if there are no arguments). The outputs of subroutine 
are returned to the calling program by means of the arguments. 
A subroutine can be invoked using the CALL statement as follows: 
CALL name (arguments) 
or 

CALL name 
The actual arguments in the calling statement must agree in a one-to- 
опе manner with the arder and type of the arguments in the subroutine. 
Example: 
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PROGRAM MAIN 
REAL A, B, R 
READ * , A, B 
CALL MUL (A, B, R) 


ME ч 


SUBROUTINE MUL(X, Y, ХҮ) 
REAL X, Y, XY 


XY-X*Y 
RETURN 
END 


The calling program assigns the values of A and B to the variables X 
and Ү in the subroutine which in turn assigns the value of Xv (computed 
in the subroutine) to the variable R. Compare this with the function 
subprogram. 

Note that the variables that are not passed as arguments may be 
passed to the subroutine using a COMMON statement, 


[ FORTRAN 90 greatly extends the power of function subprograms 
by allowing the result to be an array or structure. Function 
subprograms are designed as follows: 


FUNCTION name(argunents) RESULT (result - 
variable) 
Declaration of arguments and result-variable 


result-variable - expression 
END FUNCTION name 


Instead of function name, the result-variable is assigned the value 
that is to be returned to the calling function. The result-variable is 
а variable name that has been placed like a function argument 
with the RESULT keyword, immediately after the function name. 
Both the arguments and the result-variable are declared for their 
types. 

Note that, in FORTRAN 90, all programs and subprograms use 
the name of the program or subprogram in the END statement as 
follows: 

END FUNCTION F 

END SUBROUTINE SWAP 

END PROGRAM SORT 
FORTRAN 90 also includes features such as optional arguments, 
keyword-identified arguments and array sizes which are very 
powerful compared to FORTRAN 77. These features must be used 
wherever possible. j 
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FORTRAN allows us to write out a formula for a function and define it. 
using the assignment statement inside the program itself (instead of 
using an "external" function subprogram). Since such functions are "one- 
line” functions, they are called statement functions. A statement function 
is defined as follows: 


Function-name (arguments) = expression| 


where expression is the FORTRAN expression of the formula (or function) 
to be evaluated and argumenis is a list of variables used in the expression. 
The arguments are simple integer or real variables. Examples: 
AREA (R) = 3.1416 *R*R 
VALUE (P, R) & P* (1.0 + Ву ** N 
POLY (X, Y, M, М) = X ** M+ Y ** N 
A variable which appears in the expression but is not defined as an 
argument is called the parameter of the function. Values of such variablea 
should be defined before using the function. 
The function can be used in any subsequent lines of the program by 
writing the name of the function with actual arguments, like 


CIRCLE AREA(X) 
FVALUE = VALUE (AMOU INTEREST) 
POLYl = POLY (A, 2, 2) 


RING = AREA(X1) AREA (Х2) 


Note that the functions can be used on the right side like any other 
variables. The actual arguments may be variables or constants (or even 
expressions). However, they must agree in number order and type with 
the dummy arguments in the function definition statement. 

A statement function may use other statement functions if they are 
defined before it. Like function subprograms, the statement functions 
must be declared for their type in the program and defined after all 
declarations, but before the first executable statement. 


8291 INTRINSIC FUNCTIONS 


In numerical computing, we use mathematical functions like logarithm, 
square root, absolute value, sine, etc., very frequently. FORTRAN supports 
a library of such functions which can be invoked in our programs. Since 
these functions are part of FORTRAN, they are also called intrinsic or 
built-in functions. An intrinsic function can be invoked by simply typing 
the name of the function followed by the arguments enclosed in 
parentheses. Example: 
ABS (X) COS (THETA) SQRT(X * X + Y + Y) 

The most commonly used intrinsic mathematical functions are 
summarised in Table 5.5. When using any of these functions, it is a good 
practice to declare them using the INTRINSIC statement in the 
declaration section. 
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Table 5.5 Commonly used mathematical functions 


Function Description 
ШЕТУ ^ — Absolute value 

ACOS (x) Arccosine 

(result in radians) 
ASIN (x! Arcsine 

(result in radians) 
ATAN (x) Arctangent 

(result in radians) 
ATAN2(x,, Xl Arctangent of xix 

(result in radians) 
COS (x) Cosine 

(x in radians) 

Hyperbolic cosine 

Conversion to double precision real 

Power of e 


‘Truncation to integer 
Natural logarithm 


tbase e) 
LOGIO (x) Common logarithm 
(base 10) 
MAKIX;, Xy 00.) Maximum value 
MINIX; Xp ...) Minimum value 
MODix,, x) Remainder of division Xy Xg 
(e.g MOD(G;3) is 2) 
NINT x) Conversion to nearest integer 
REAL (x) Conversion to single-precision real 
SIM(x) Sine 
(а in radians) 
SIN (x) Hyperbolic sine 
SORT tx) Square root 
+200 
TANGO Tangent 
(x in radians) 
|“ тмн: Hyperbolic tangent. d 


gt Ie ave coinmen, Thi ломе, o program must t sted 
у errors before it is used. Errors in computer code are called bugs 
ess of correcting them is called dehy E. 
program mor he grammatically етот free but may produ 
Sc actimes, а program may produce correct results for 
data «nd wrong results fur another Set. Such errors are due t пргорег 
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program logic and are therefore known as logic errors or run-time errors. 
It is a good practice to test the program for all possible range and 
combination of data. 

Documentation is a most important but often neglected activity by the 
programmers. Documentation provides all details about the program 
intent, its variables and other requirements that allow the users to 
immediately understand and implement the program more easily. 

Documentation includes two parts — jnternal documentation and ex- 
ternal documentation. Internal documentation means the use of explan- 
atory remarks throughout. the program, which describe how various parts 
of the program work. This is very important from the maintenance point 
of view. 

External documentation includes instructions to the users on how to 
implement the program and what actions should be taken in certain 
special circumstances. Such a document is called user manual. 


EOS SUMMARY 


We presented an overview of FORTRAN 77 in this Chapter. We discussed 
briefly the following features that are frequently used in developing 
numerical computing software: 
e various categories of data types used to represent numbers and 
characters 
rules of defining variable names and creating storage space for 
them 
o creation and use of subscripted variables to represent tables of data 
© input/output statements required to read data values and print 
results 
o operators used for evaluating mathematical and logical expressions 
control structures supported by FORTRAN 77 
design and use of subprograms in building а large application 
program 
We have also highlighted the FORTRAN 90 features wherever applicable. 


Algebraic expression Logic errors 
Arguments " Logical constants 
Arithmetic operators Logical expression 
Array Logical operators 
Assignment statement Logical type 

Backward jump Looping structure 
Bugs Main program 

Built-in functions Mixed-mode ex; 'ession 
Calling program Modular programming 
Calling statement. Modules 


€ | 
(Contd.) 
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(Contd.) 
Character constants Multidimensional array 
Character type Named constant 
Comment line Nesting 
Complex constants Non-executable statement 
Complex type One-dimensional array 
Conditional execution Operators 
Constants Outer loop 
Control statement Output statement 
Control variable Parameter statement 
Counter Positional form 
Debugging Precedence rule 
Dimension Processing block 
DO loop Program statement 
DO..WHILE structure Program. unit header 
Documentation Real constants 
Double precision Real type 
Driver program Relational operators 
Dummy variables Run-time errors 
Executable statement Size 
Exponential form ‘Statement functions 
Expressions Sialement label 
Flow chart Statement number 
Format-directed I/O statement Subprogram 
FORTRAN expression Subroutine 
Forward jump Subscripted variable 
Functions Subscripts 
IF..ELSE structure Symbolic constant 
Initialisation Testing 
Inner loop Two-dimensional array. 
Input statement Type declaration 
Integer constants User manual 
Integer type Variable declaration 
Intrinsic functions Variables 
List-directed I/O statement 

1. What are FORTRAN constants? 

2. What are logical constants? Where are they used? 

3. When do we use the exponential form to represent real numbers? 
4. What are variables? State the rules of naming variables? 

5. What is an array? When do we use arrays in computing? 

6. What is meant by declaration of variables? How are array variables 


declared in FORTRAN? 
7. Describe the actions of the following statements: 
(a) READ * 
(b) PRINT * 
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(c) READ (*, 100) 
(4) WRITE (*, 200) 
(e) WRITE (6, 200) 
(© WRITE (*,*) 
What is the function of a FORMAT statement? Give an example. 


. How are DATA statements used to provide values to variables? 
. State the hierarchy of operations followed by FORTRAN in 


evaluating expressions. 


. List FORTRAN statements that are used to implement conditional 


execution statements. How are they different in terms of 
implementation? 


. Give two examples of each of the following expressions; 


(a), Relational expression 

(b) Logical expression 

(c) Mixed -made expression 
Is there any special caution to he exercised in writing these 
expressions? Explain. 


13. Why should we avoid the use of real variables as DO loop 


parameters? 


‚ What is nesting? When do we need to use nesting in numerical 


computing? 

What are subprograms? How are they used in program development? 
Distinguish between the function subprogram and subroutine 
subprogram. 


. What is a statement function? How is it different from function 


subprogram? 


. Give at least two examples of using statement functions in numer- 


ical computing? 


. What is testing? How is it different from debugging? 
. Describe the importance of documentation for programmers and 


program users. 


. Which of the following are illegal FORTRAN names? Why? 


(a) TOTAL (b) PART - 1 

(с) % MARK (4) REAL 

(е) A+ () 3M 

(g) X23 thi X AXIS 
Classify each of the ці sein “stunts as an integer constent or a 
real constant. È 5 ate the reasons 

(a) 123 te) ‘123! 

(d 12 +83 (f) +12 

(в) 12/ ti) FOUR 

(| -1.5602 


im) E5 


» 10) 25.5- 
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. Which of the following are legal character constants? 


(а) ‘A’ (b) TOTAL’ 
(с) *1.23" (4) ТОТА MARKS’ 
(e) ‘NEWTON'S LAW’ (Ð 'A.B.RAM" 
. Write the FORTRAN expressions for the following: 
ay ИРИ 1 atb Ж 
(а) cry (с) — OB -e)axd) 
(b) ax*y + bxy* + с (d) (2h 
y 


. Find the values of M and А when each of the following arithmetic 
statements is executed. 
(а) A225 30 ** 2/3.0 
(b) A = 2,0 ** 2 + 3.0 ** 2 — 4.0 * 3.0/2.0 
(с) A = 16/2 ** 3 + 5/2 * (2 * (6 — 4) 
(d) М = (9/4)⁄3/2) 
(e) M=4 ** 2 * (2/3) 
Identify errors in the following assignment statements: 
(a) X = SUM/N 
(b) А = COS(X) + FLOAT(N) 
(с) № = QUY) * (Х/2) 
(d) М-1 = (А + В + C/D 
(е) W = X** -2 + SQRT(N) 
(f) D =P * ALOG (-3.5) 
. Following statements contain mixed mode expressions. Correct them 
using 
(a) the type declaration statements 
(b) the type conversion functions 
(c) none of the above 
(1) AREA = LENGTH * WIDTH 
(2) FORCE - MASS * ACCEL 
(3) DIST = SQRTIN ** 2) 
. Using library functions construct FORTRAN statements for the 
following: 


(a) А = /s(s - a) (s - b) (s — c) 
(b) c= Ja? +b? – 2abcos (x) 


(e) за] 
X * y| 


(x+y)? 

NEC 

. Given below are three sets of expressions. The two expressions in 
each set, though appear to be identical, do not produce the same 
results for certain values of integer variables Z, J and К and the 


el 
(d) fesse 
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10. 


12. 


м. 


15. 


real variable X. Identify those values for which the two expressions 
are not equal. 
(a) (+ J)/ K and I/K + JIK 
(b) I*G/K) and I *J/K 
(с) X*IJ — and X* (4) 
Write a program to read two values from the keyboard and to 
display their sum along with their values on the screen as Tollows: 
(a) All the three values in one line 
(b) Values one below the other in separate lines 


. Given the lengths of the two sides of a right triangle, write a 


program to do the following: 
(a) To read the values of two sides from the keyboard 
(b) To calculate the area of the triangle (one-half the product of 
two sides) 
(c) To calculate the length of hypotenuse (square root of the 
sums of squares of the sides) and 
(d) To print the results with labels like 
Area = 
Hypotenuse = 
Write a program to evaluate the expression 


(x )* - 2: 2 
y tX 20у у 


zi 
and print the results for the following values of x and y: 
ta) x = 0.05 and у =900 ` 
(b) x = 0.005 and y = 900 
(c) x = 0.002 and y = 900 
Are the results different? 
Note that the above expression, on simplification, reduces to w = 1. 
If the results are different, why? 


. Declare the variables to be double precision in the above program 


and see the results. Is there a difference? If so, why? 
Write a program which requests the user to type in a number, If 
the number is four digit long or longer, then the computer should 
provide a message that the number is too large. If it is two digit 
long or shorter, then the message should be that the number is too 
small. Otherwise, print a message 
WELL DONE, WE THINK ALIKE 
Write a program to solve the quadratie equation 
ах? + bx «c2 0 
by using the formula 
b+ o? - 4ac 

2a 
The program should display the real roots or message indicating 
that there are no real roots (if b^ — 4ac is negative). 


roots = 


16. 


17. 


18. 


19. 


21. 


22; 
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Write a program to read the marks obtained by 25 students in a 
class and count the number of students with marks in the following 


е: 
(а) 0 to 39. (Fail) 
(b) 40 to 59 (Pass) 
(c) 60 and above (Pass with I class) 
Write programs to evaluate the following functions to 0.0001 per 
cent accuracy. 


x? х5 x! 


e 
(b) cos x mittant 


Write a program to read a set of numbers, count them, and find 
and print the largest and smallest numbers in the list and their 
positions in the list. 
Write a program to calculate and print the mean, variance, and 
standard deviation of a set of N numbers. 
( N 
—| Lx 
Nii N P 


Standard Deviation = / Variance 
Write a program to find the largest element of a given matrix and 
print out the value with location details. 
Write a subprogram to evaluate the factorial of a number which is 
given by 


x 
Mean = Ух; 


isl 


x 
Variance = 2. zx 1 


n!zn(n-1)(n-2)...1 
Using this subprogram write a main program to calculate the bino- 
mial coefficient 
EN: 
b= Gal 
This gives the number of combinations of n objects take г at a time. 


Write a subroutine subprogram that will interchange the values of 
two variables when called. 


. Write a menu-driven program that allows the user to use one of the 


following options: 
(a) To convert miles to kilometres 
(b) To convert feet to metres 
(c) To convert degrees Fahrenheit to degree Celsius 
Note: 1 mile = 1.60935 kilometres 
1 foot = 0.3048 metres 
C = 5/9 (F - 32) 
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24. Rewrite the following program so that it will take minimum time 
for execution. 


111 
222 


READ (5, 111) W, Cl, C2, С, R., V 

CRUL e Wo * OL * (1.0 +S * Rp ee 

СА s'w * (62 «DL * [1.D +O" Rb * V 
САР = C2 + Ci * (1.0 + G * R) 

WRITE (6, 222) CRTl, CRT2, CAP 

FORMAT [6F10.2) 

FORMAT (1Hb, 2F10.2, 5X, F10.5) 

STOP 

END 


25. Improve the following program segments: 


1. 


READ (5, 11) X, Y 

DO 50 І = 1, 100 

АІ IX * X. Y * VI/A 
WRITR (6, 22) A 

FORMAT (2F5.2) 

FORMAT (1Hb, F10.5) 
STOP 

END 


Xi - X « Y/B * C 


Roots of Nonlinear 
Equations 
o — M 


Ez INTRODUCTION 


Mathematical models for a wide variety of problems in science and engi- 
neering can be formulated into equations of the form 


Кх) –0 (6.1) 


where х and Дх) may be real, complex, or vector quantities. The solution 
Process often involves finding the values of x that would satisfy the 
Eq. (6,1). These values are called the roots of the equation. Since the 
function f(x) becomes zero at these values, they are also known as the 
zeros of the function ftx). 

Equation (6.1) may belon, 

1. Algebraic Siuatong 

2. Polynomial equations L y 

3. Transcendental equations „7 

(Any function of one variable which does not graph as a straight line 
in two dimensions, or any function of two variables which does not 
graph as a plane in three dimensions, can be said to be nonlinear.) 
Consider the function 

у = х) 


Ax) is а linear function, if the dependent variable у changes in direct 
proportion to the change in independent T x. For example 


to one of the following types of equations: 
MEE 


|y28:45 
is a linear function. Eam 

On the other hand, f(x) is said to be nonlinear, if the response of ће 
dependent variable y is not in direct or exact proportion to the changes 
in the independent variable x. For example 4 
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у=? +1 


is a nonlinear function. 
There are many situations in science and engineering where the 
relationship between variables is nonlinear. 


\UAtgebraic Equations 
An equation of type у = fix) is said to be algebraic if it can be expressed 
in the form 
Frnt aset t fiie fon 0 (6.2) 


where f, is an ith order polynomial in x. Equation (6.2) can be thought of 
as having a general form 
Кх, y)=0 (6.3) 


This implies that Eq. (6.3) portrays a dependence between the variables 
x and y. Some examples are: 


1. Зх + бу – 21-0 (linear) 
2. 2x + àxy – 25 = 0 (non-linear) 
3. х? -xy- Зу? = 0 (non-linear) 


‘These equations have an infinite number of pairs of values of x and y 
which satisfy them. 


Polynomial Equations 
Polynomial equations are a simple class of algebraic equations that are 
represented as follows: 
a,x" +a, x"! +... +a, x+a9=0 ) (6.4) 

This is called пі? degree polynomial and has л roots, The roots may be 

1. real and different - 

2. real and repeated 

3. complex numbers 
Since complex roots appear in pairs, if п is odd, then the polynomial has 
at least one real root. For example, a cubic equation of the type 


age + ax" + ax +ау=0 
will have at least one real root and the remaining two may be real or 
complex roots. Some specific examples of polynomial equations are: 
1 бх®-х*%+3х®=0 
2. х9-4х'+х+6=0 
3. x!- Ax «4-0 


X /Transcendental Equations 


СА non-algebraic equation is called a transcendental equation.) These. 
include trigonometric, exponential and logarithmic functions. Examples 
of transcendental equation are: è 
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> 
L 2вшх-х=0 / 
2.727 sinx- 1/2 x= 0 


3. log x= x= 1=0 
\ 42-0 


A transtendental equation may have a finite or an infinite number of 
real roots or may not have real root at all. 


Eza METHODS OF SOLUTION 


There are a number of ways to find the roots of nonlinear equations such 
as those described in Section 6.1. They include: 

У Direct analytical methods 

‘2 Graphical methods 

Trial and error methods 

^. Iterative methods F 
In certain cases, roots can be found by using direct analytical methods, 
For example, consider a quadratic equation such as 


ax? + bert c=0 (6.5) 
We know that the solution of this equation is 
bs bi. 


(6.6) 


v qu 


2a 
Equation (6.6) gives the two roots of equation (6.5). However, there are 


equations that cannot be solved by analytical methods. For example, the 
simple transcendental equation 


2sinx-x=0 


cannot be solved analytically. Direct methods for solving non-linear 
equations do not exist except for certain simple cases, 

Graphical methods are useful when we are satisfied with approxi- 
mate solution for a problem. This method involves plotting the given. 
function and determining the points where it crosses the x-axis. These 
points represent approximate values of the roots of the function. 

Another approach to obtain approximate solution is the trial and error 
technique. This method involves a series of guesses for x, each time 
evaluating the function to see whether it is close to zero. The value of x 
that causes the function value closer to zero is one of the approximate 
roots of the equation. 

Although graphical and trial and error methods provide satisfactory 
approximations for many problem situations, they become cumbersome 
and time consuming. Moreover, the accuracy of the results are inadequate 
for the requirements of many engineering and scientific problems. With 
the advent of computers, algorithmic approaches known as iterative 
methods have become popular. An iterative technique usually begins 
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with an approximate value of the root, known as the initial guess, which 
is then successively corrected iteration by iteration. The process of 
iteration stops when the desired level of accuracy is obtained. Since 
iterative methods involve a large number of iterations and arithmetic 
operations to reach a solution, the use of computers has become inevitable 
to make the task simple and efficient. 

In this chapter, we shall discuss а few iterative methods of solution 
that are commonly used. These methods are designed to determine the 
value of a single real root, using some initial guess values. Later in the 
chapter, we shall also discuss methods to determine all the roots of a 
polynomial. Finally, we shall discuss the solution of a system of non- 
linear equations. 


E ITERATIVE METHODS 


‘There are a number of iterative methods that have been tried and used 
successfully in various problem situations. All these methods typically 
generate a sequence of estimates of the solution which is expected to 
converge to the true solution. As mentioned carlier (all iterative methods 
bogin their process of solution with one or more guesses at the solution 
being sought. Iterative methods, based on the number of guesses they 
use, can be grouped into two categories: 

1. Bracketing methods 

2, Open end methods е 

Bracketing methods (also known as interpolation methods) start with 
two initial guesses that ‘bracket’ the root and then systematically reduce 
the width of the bracket until the solution is reached. Two popular 
methods under this category are: 
«`T. Bisection method 

"v. False position method > 
These methods are based on the assumption that the function changes 
sign in the vicinity of a root. 

Open end methods (also known as extrapolation methods) use a single 
starting value or two values that do not necessarily bracket the root. 
The following iterative methods fall under this category: 


ле method 
Secant method 


3. Muller's method 
4, Fixed-point method * 
‘4 Bairstow's method 
It may be noted that the bracketing methods require to find sign changes 
in the function during every iteration. Open end methods do not require 
this. 
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С STARTING AND STOPPING AN ITERATIVE 
PROCESS 

Starting the Process 

Before an iterative process is initiated, we have to determine either an 

approximate value of root or a “search” interval that contains a root. 

One simple method of guessing starting points is to plot the curve of, fix) 

and to identify a search interval near the root of interest, Graphical 

representation of a function cannot only provide us rough estimates of 

the roots, but also help us in understanding the properties of the function, 

thereby identifying possible problems in numerical computing. A plot of 

fx) =х$-х-1 

is shown in Fig. 6.1. Although Лх) is a cubic function, it intersects the 

x-axis at only one point. This suggests that the remaining two roots are 

imaginary ones. 


Fig. 6.1 Plot of {x)= -x-1 


In the case of polynomials, many theoretical relationships between roots 

and coefficients are available. A few relations that might be useful for 

making initial guesses are described here. 

paest Possible Root For a polynomial represented by 

Fi fix) =а, 1 «a, x" t... ag x + ag (6.7) 
the largest possible root is given by 


а„-\ 


a; =- (6.8) 


a, 
This value is taken as the initial approximation when no other value is 
suggested by the knowledge of the problem at hand. 
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Search Bracket Another relationship that might be useful for deter- 
mining the search intervals that contain the real roots of a polynomial is 


" z vo lk 
bs =) e) (69) 
a, a, 
where x is the root of the polynomial. Then, the maximum absolute 
value of the root is 


2 
а,.1 an-2 
Ew (6.10) 
This means that no root exceeds x,,,, in absolute magnitude and thus, 
all real roots lie within the interval ( —|х' ||, |). 

There is yet another relationship that suggests an interval for roots. 
All real roots x satisfy the inequality 


1 
lets te max feol lail.-..la, il] (6.11) 
a 


where the “max” denotes the maximum of the absolute values |а|, 
Vilis lanal- 


Sider the polynomial equation 
ох? BP 4 2x 12-70 


1 : 
Estimate the possible initial Büess Vilues 


That is, no root can be larger than the value 4. 
All roots must satisfy the relation 


T 


eed -(2)- Jia 
Therefore, all real roots lie in the interval (- Ла, 14). Visas use 


these two points as initial guesses for the bracketing methods and one of 
them for the open end methods. - Е 
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Stopping Criterion 
An iterative process must be terminated at some stage. When ? We must 
have an objective criterion for deciding when to stop the process. We 
may use one (or combination) of the following tests, depending on the 
behaviour of the function, to terminate the process: 


1. [xi - xi| < E, (absolute error in x) 
bcp] 

2. 122—4 «E, (relative error in x), x 20 

ө | 

3. Mrl < E (value of function at root) 

4. [fixi - ДХ) < E (difference in function values) 

5. [fol < Fs, (large function value) 

6. Ixl < XL (large value of x) 


Here, x, represents the estimate of the root at ith iteration and F(x) is 
the value of the function at s 

There may be situations where these tests may fail when used alone. 
Sometimes even a combination of two tests may fail. A practical conver- 
gence test should use a combination. of these tests. In cases where we do 
not know whether the process converges or not, we must have a limit on 
the number of iterations, like 


Iterations > N (limit on iterations). 


RA EVALUATION OF POLYNOMIALS 


All iterative methods require the evaluation of functions for which solu- 
tion is sought. Since it is a recurring task, the design of an efficient 
algorithm for evaluating the function assumes a greater importance. 
While it is not possible to propose a general algorithm for evaluating 
transcendental functions, it is quite simple to design an algorithm for 
evaluating polynomials, 

The polynomial is a sum of n+1 terms and can be expressed as 


f(x) Xa, xi =ag+ Fa, x (6.12) 
£6 


x а 
This can be easily implemented using а DO loop in FORTRAN. This 
would require n(n + 12 multiplications and л additions. 
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Сиш  _ 


Write a FORTRAN program segment to implement Ед. (6.12). 


SUM = АО 
po 100 1 = 1, N 
SUM - SUM + A(I) * X ** I 
100 CONTINUE 


Let us consider the evaluation of a polynomial using Horner's Rule ав 
follows: 


fx) = (..((a, x + à, тх + a, gh +. +арх 4 ag) (6.13) 


Here, the innermost expression a,x + c, , is evaluated first. The result- 
ing value constitutes a multiplicand for the expression at the next level. 
The number of level equals n, the degree of polynomial. Note that this 
approach needs a total of n additions and n multiplications. 

Horner's rule, also known as nested multiplication, is implemented 
using Algorithm 6.1. The quantities p,, p, 1, ...‚ po are evaluated 
recursively. The final quantity po gives the value of the function flx). 


Horner's Rule 


Pn = 0n 
Prt = Pr& + Ona 


Pj= рах +0 


ру= pe + ау 
fix) = po = рух + do 
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B. _ 


Evaluate the polynomial 
f= - а? 4246 
using Horner’s rule at x = 2. 


n= 3, ay = 1, az = —4, a} = 


рз=аз=1 
pasix2-(-4)--2 
py=(-2)x24+1=-3 
po=(-3)x2+6=0 
#2) =0 


Program POLY 


Program POLY shows a FORTRAN program to evaluate a polynomial of 
degree л using Horner's rule. This program uses a subroutine HORNER 
to implement Horner's algorithm. 

It is an interactive program and, therefore, requests input for degree 
of polynomial (n), polynomial coefficients (a;) and value of x from the 
user at the time of execution. Output of a sample run of the program 
POLY is given at the end of the program: 
рац ра Hees dantes sti prete ылыы — * 
PROGRAM POLY 
* Main program 
Program POLY evaluates a polynomial of degree л at 


any point X using Horner's rule 


NIL 


* 
Functions invoked j 
* 
Subroutines used * 


N - Degree of polyni 
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EXTERNAL HORNER 
DIMENSION A(10) 
WRITE(*,*) ‘Input degree of polynomial, N' 
READ(*,*) N 
WRITE(*,*) 'Input polynomial coefficients ( A(0) 
to A(N)) 

DO 100 І = 1, Nel 

READ(*,*) A(T) 


100 CONTINUE 


* 


WRITE(*,*} "Input value of X (point of evaluation) 
READ(*,*) X 

Evaluating polynomial at X using Horner's method 
CALL HORNER: N,A,X,P ) 

Writing the result 


WRITE(*,*) 
WRITE(*,*) Р(Х) = ', P, at X = ux 
кузе аш 
STOP 
END 
SUBROUTINE HORNER( N,A,X,P ) 
Subroutine 
HORNER computes the value of a polynomial of order 
n at any given point x. 
Arguments 
Input 
N - Degree of polynomial 
^ - Polynomial coefficients (array of size N«1) 
X - Point of interest of evaluation 
Output 


P - Value of polynomial at X 


NIL 


tee non 
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REAL A, X, Р 
INTEGER N 
DIMENSION A(10) 
P = Л(М№1) 
DO 111 I N, 1, -1 
P = P*X + A(T) 
111 CONTINUE 
RETURN 
END 
--- End of Subroutine HORNER ------------- * 


Test Run Results 
Input degree of polynomial, М 
H 


Input polynomial coefticients (A(0) te A(N)) 
12 


2 ^" 


input value of X (point of evaluation) 
0.1 


Р(Х) = 12.7226600 at 


The polynomial used for evaluatie 


F Gx? + 5x + 12 
and the coefficients apeTepresented by A(1), А(2), A(3), and A44) 
instead of A(0), 11), ^12), and A(3) in the program. 


pHiseclion method is one of the simplest and most reliable of iterative 
rfethods for the solution of nonlinear equations. This method, also known 
as binary chopping or half interval method, relies on the fact that if fix) 
is real and continuous in the interval a < x < b, and fla) and fib) are of 


opposite signs, that is, 
fa) fib) <0 


then there is at least one real root in the interval between а and b. 
(There may be more than one root in the interval). 

Let x, = a and xy = Б. Let us also define another point xy to be the 
midpoint between @ and b. That is, 


(6.14) 


х= 
LS 


p 
Now, there exists the following three conditions: 
1. if fixo) = 0, we have a root at x. 
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2. if xq) fi) < 0, there is a root between xp and x}. + 207 

3. if flxpfix;) < 0, there is a root between x; and ху. 
It follows that by testing the sign of the function at midpoint, we can 
deduce which part of the interval contains the root. This is illustrated in 
Fig. 6.2. It shows that, since fix) and fix) are of opposite sign, a root lies 
between x, and xp. We can further divide this subinterval into two halves 
to locate a new subinterval containing the root. This process can be 
repeated until the interval containing the root is as small as we desire. 


Fig. 6.2 liuslration of bisection method 


the equation 7 = 
Fla- / | ael =- 


bisection method. 


Thus 


Therefore, we have both the roots in the interval (-6, 6). The table below 
gives the values of f(x) between -6 and 6 and shows that there is a root 
in the interval (-2, —1) and another in (5, 6). 


х 1-6 1-5 41-31-2110 | 1] 2| 8| 4 [5 [.6 
Кх)| 50 | 35 | 22 | 11.2 -10|-12|-14|-13|-10 |-8 | 2 5 
Let us take x, =-2 and x, = -1. 1 
Then * = 3 j vol 
-2-1 ^ 
———Ü€ XC 
2 


11-2) = 2 and Д-1.5) = -1.75 
Since /(-2) fl-1.5) < 0; the root must Бе in the interval (-2, -1.5). The 
next step begins. 
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75 fr 
Pme 
.5) are of opposite sign, the root lies in the inter- 
val (-1.75, -1.5). Another iteration begins. 
ху = -1/15, x, = -1.5 and ху = —1.625 


#,2-2 хо = -1.5 and x9 = 


fi-1.625)=-6.859 ` ^ 
Now, the root lies in the interval (-1.75, -1.625) 
хо = -1.6875 
11.6875) = –0.40 
Next au" 1051 1898, 
11.72) = -0616 
Мех z=- Leam = -1.735 А 
11.735) = -0.05 
Next xg = -1.7425 
f(-1,7425) = «0,0063. — 
"The root lies between –1.735 and -1.7425. Va 


Approximate root is -1.7416. © 
a 
An algorithm to achieve this is given in Algorithm 6.2. 


Bisection Method 


. Decide initial values for x, and x, and stopping criterion, E. 
.. Compute f, = f(x,) and (= f(x;). 
If, x f > 0, x, and x, do nol bracket any root and go to step 7; 
Otherwise continue. 
. Compute x, = (x, + %)/2 and compute f, = f(x) 
5. Иһ кж then 


Sel x, = X, 
else 1 


set X, = Xo 
set f, = f 
6. If absolute value of (х, — х,)/х is less than error E, then 
root = (x, + X2 
write the value of root 
go to step 7 


else 
go to step 4 
7. Stop. 
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Convergence of Bisection Method 


In the bisection method, we choose a midpoint x, in the interval between 
x, and x». Depending on the sign of functions fq), fix), and fixo), x, or 
% is set equal to хо such that the new interval contains the root. In 
either case, the interval containing the root is reduced by a factor of 2. 
The same procedure is repeated for the new interval, If the procedure is 
repeated л times, then the interval containing the root is reduced to the 
size 

X2-X) Ax 

2^ 2^ 

After n iterations, the root must lie within + Ax/2" of our estimate. This 
means that the error bound at n“ iteration is 


Ax 
- Е, = FH 
Similarly, 
Ах 
Enti (6.15) 


That is, the error decreases linearly with each step by a factor of 0.5. 
The bisection method is, therefore, linearly convergent. Since the 
convergence is slow to achieve a high degree of accuracy, а large number 
of iterations may be needed. However, the bisection algorithm is 
guaranteed to converge. i 


Program ВІЅЕС' 


This program finds & root of a nonlinear equation using the bisection 
method, BISECT uses a subroutine, BIM to find a root in a given interval 
and invokes a function subprogram, F(x) to evaluate the function at the 
estimated root, 

‘The subroutine subprogram BIM locates a root in the given interval 
ГА, B] using Algorithm 6.2. BIM applies the following criterion for 
terminating the process 


That is, the relative error in the Successive approximations must be less 
than a specified error limit. 

The function subprogram F(x) simply evaluates the function value at 
а given value of x and returns the result to the calling module. Note that 
by simply changing the function definition statement 


Fzx*x«x-2 
we can use the BISECT program to evaluate a root of any function. 
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Also note that the program prints out a message in case the specified 
interval does not bracket a root. 


* 


PROGRAM BISECT 


Main program t 
This program finds a root of a nonlinear equation 
using the bisection method 


* Functions invoked 


n 


Subroutines used 
BIM 


* 
* 

"So MM 
* Variables used bod 
х № Left endpoint of interval isd 
* B - Right endpoint of interval & 
* S - Status * 
*  ROOT - Final Solution ud 
* COUNT - Number of Iterations done i 
* Constants used * 
* EPS - Error bound Ы 


REAL A,B,ROOT,EPS,F 
INTEGER 5, COUNT 
EXTERNAL BIM,F 
PARAMETER (EPS=0 . 000001) 
WRITE(*, *) 
WRITE(*,*) ' SOLUTION BY BISECTION METHOD’ * 
WRITE(*,*) 
WRITE(*,*) 'Input starting values* 
READ(*,*) А,В 
CALL BIM(A,B,EPS, S, ROOT, COUNT) 
IF (S.Eg.0) THEN 
WRITE(*,*) 
WRITE(*,*) ‘Starting points do not bracket any root’ 
WRITE(*,*) ‘(Check whether they bracket EVEN roots)' 
WRITE(*,*) 
ELSE 
WRITE(*,*) 
WRITE(*,*) ‘Root = ', ROOT 
WRITE(*,*) 'F(Root) = ', F(ROOT) 
WRITE(*,*) 
WRITE(*,*) 'ITERATIONS - ', COUNT 
WRITE(*,*) 


* 
D 
* 
* 
* 
* 
* 
B 
* 
* 
B 
* 
^ 
* 
* 
* 


* 


Subroutine 


This subroutine finds a 
in the interval 


соро 


Input 


[A,B] 


root of nonlinear equation 
using the bisection method 


А - Left endpoint of interval 

B - Right endpoint of interval 

EPS - Error bound 
Output 

S - Status 

ROOT - Final Solution 

COUNT - Number of Iterations 


Local 


Variables 


X1,X2, X0, F0, F1, F2 


F,ABS 


REAL A,B,ROOT,EPS, F,X1,X2,X0, FO,F1,F2,ABS 


INTEGER S,COUNT 


EXTERNAL F 
INTRINSIC ABS 


Punction values at initial points 


Xl = 
X2 = 
F1 

F2 = 


A 
B 
F(A) 
F(B) 


Test if initial values bracket a SINGLE root 
IF(Fl*F2 
$-0 
RETURN 


ENDIF 


-Ст.0) 


ТНЕМ 


Bisect the interval and locate the root iteratively 


COUNT 
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111 X0 = (X1+x2)/2.0 
FO = F(X0) 
IF (FO .EQ. 0) THEN 
8-1 
ROOT = XO 
RETURN 
ЕМІР 


IF(F1*FO .LT.0) THEN 
х2 = x0 


ENDIF 


* Test for accuracy and repeat the process, if necessary 
ТЕ (АВБ((Х2-Х1) /X2) .LT. EPS) THEN 
5-1 
ROOP = (х1«Х2)/2.0 
RETURN 
ELSE 
COUNT = COUNT + 1 
GO TO 111 
ENDIF 


END 


REAL FUNCTION F(X) 


REAL X 

F = X*X+x-2 

RETURN 

END 
fol End of function F(X)----------------- * 
Test Results of BISECT 


The program was used to solve the equation 
х®+х-2=0 
using two sets of starting points: 
(0.0, 2.0) and (0.5, 2.0) 
First run 
SOLUTION BY BISECTION METHOD 
Input starting values 
0.0 2.0 
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Root = 1.0000000 


F(ROOT) = .0000000 

ITERATIONS = 1 

Stop - Program terminated. 
Second run 


SOLUTION BY BISECTION METHOD 
Input starting values 


0.5 2.0 

Root. = 9.999999Е-001 
F (ROOT) = -3.576279E-007 
ITERATIONS = 21 


top - Program terminated. 


FALSE POSITION METHOD 


Àh bisection method, the interval between x, and x; is divided into two 
еді ves, irrespective of location of the root. It may be possible that 
the root is closer to one end than the other as shown in Fig. 6.3. Note 
that the root is closer to x,f Let us join the points x, and х, by a straight 
line. The point of intersection of this line with the x axis (xy) gives an 
improved estimate of the root and is called the false position of the root. 
This point then replaces one of the initial guesses that has a function 
value of the same sign as /(хо). The process is repeated with the new 
values of x, and хо. Since this method uses the false position of the root 
repeatedly, it is called the false position method (or regula falsi in Latin). 
It is also called the linear interpolation method (because an approximate 
root is determined by linear interpolation). 


" < 
ш Go fe NX 
х= m d 
"d ^ 
| (25277 
Fig. 6.3 illustration of false position method 
False Position Formula 


А graphical depiction of the false position method is shown in Fig. 6.3. 
We know that equation of the line joining the points (ху, f(x))) and 
(ху, Ро) is given by 

Роа) Р) y-fGi) 


-x qx. v (616) 
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Since the line intersects the x-axis at x, when x = ху, у = 0, we have 


or 


Xo-%) = 


Гоа х1) v 
obi Теа) fe) баб 
This equation is known as the false p position formula. Note that x, is 


obtained by applying a correction to x;. 


False Position Algorithm 


Having calculated the first approximate to the root, the process is repeated 
for the new intorval, as done in the bisection method, using Algorithm 6.3. 


Then, we have 


False Position Method 


ШЕ... Бы. гаг. ШИЙ 
хә) fo) 
If fix) х fü) «0 


Let Xo = х1 -fGi) к 


set x; = ху 
otherwise 
Set x, = Xp 


A major difference between this algorithm and the bisection Shin is 
the way xp is computed. 


Use the false position method to find а root of the function 
fix) =x*-x-2=0 
in the range 1 <x <3 _ E 


Iteration 1 
Given x, = 1 and xy = 3 
Ра) =f) =-2 
Ёк) = А3) = 4 


х-Ху 


хуу Лор) 2—1 
агац =й fixg)—flx,) 
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3-1 


=1+2 = 1.6667 
ii 


Iteration 2 
fizo) f(xy) = f(.6667)f(1) = 1.7778 
Therefore, the root lies in the interval between x, and xz. Then, 
ху = Xp = 1.6667 
Роа) = (1.6667) = -0.8889 
Гоз) = АЗ) = 4 
3- 1.6667 
4+ 0.8889 


= 1.909 


Xp = 1.6667 + 0.8889 x 
= 
Iteration 3 
F(1,909) (1.6667) = +0.2345 
Root lies between хо = 1.909) and x,(=3) 
Therefore, 
ху = хо = 1.909 
PEE) 


= 1.909 + 0.2647 x 3.7353 ^ 


The estimated root after third iteration is 1.986. Remember that the 
interval contains a root x = 2. We can perform additional iterations to 
refine this estimate further. oi? 


Convergence of False Position Method 2 


The false position formula is based on the linear interpolation model. In 
the false position iteration, one of the starting points is fixed while the 
other moves towards the solution. Assume that the initial points brack- 
eting the solution are a and 6 and that a moves towards the solution and 
6 is fixed as illustrated in Fig. 6.4. 

Let x, = a and x, be the solution. 


Then, 
еү=х,-х| 
Ln -Is 
That is, 
@=з-җ 


It сап be shown that 


Ge - BFR) (6.18) 
FG 


where R is some point in the interval x, and b. This shows that the 
process of iteration converges linearly. 


еі =е, х 
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f 


Fig. 64 Convergence of false position method 


Program FALSE 


The program FALSE finds a root of a nonlinear equation using the false 
position method. The program uses a function subprogram F and a sub- 
routine, FAL to implement the method. 

The function evaluates the function at any given point and the sub- 
routine determines a root in a given interval using Algorithm 6.3. 

We can use the FALSE program to identify a root of any function by 


Т of a nonlinear equation * 
* Funerions ‘ 
* Subroutine * 
* FAL ё 

+ 
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WRITE(*,*) ‘Input starting values’ 
READ(*,*) A,B 
WRITE(*,*) 
WRITE(*,*) * SOLUTION BY FALSE POSITION METHOD’ 
WRITE(*,*) 
CALL FAL(A,B,EPS, S, ROOT, COUNT) 
IF(S.EQ.0) THEN 
WRITE(*,*) 'Starting points do not bracket any 
root' 
WRITE(*,*) 
ELSE 
WRITE(*,*) 
WRITE(*,*) 'Root - ', ROOT 
WRITE(*,*) 'F(ROOT) = ^, F(ROOT) 
WRITE(*,*) 'NO.OP ITERATIONS - ', COUNT 
WRITE(*,*) 
ENDIF 


End of main FALSE 


Subroutine 
FAL finds а root of a nonlinear equation 


А - Left-end point of interval 
B - Right-end point of interval 
EPS - Error bound 

Output 
S - Status of completion of task 
ROOT - Final solution 
COUNT - Number of iterations done 


Local Variables 
X0,X1,X2,F0,F1,F2 


Functions invoked 
F,ABS 


Subroutines called 
NIL 
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REAL A,B,EPS,X0,X1,X2,F0,Fl,F2,F,ABS 
INTEGER S,COUNT 

INTRINSIC ABS" 

EXTERNAL F 


Xl = A 
х2 в 
Fl = Р(Х!) 
F2 = F(X2) 


* Test if A and B bracket a root 
IF(Fl*F2 .GT.0) THEN 
5 = 0 
RETURN 
ENDIF 
WRITE(*,*)" x1 x2 


COUNT = 1 
111 X0 = Xl - Fl * (X2-X1)/(F2-FL) 
FO - Р(Х0) 
IF(Fl*FÜ .LT.0) THEN 
X2 = KO 
F2 = FO 


ENDIF 
WRITE(*,*) X1,X2 


* Test if desired accuracy is achieved 


IF(ABS((X2-X1)/X2) .LT.EPS) THEN 
5 = 1 
ROOT - (Х1+Х2) *0.5 
RETURN 
ELSE 
COUNT = COUNT+1 
бо TO 111 
ENDIF 


ae киерин рсе * 


* Function subprogram Р(Х) 


REAL FUNCTION Р(Х) 
REAL X 


Fo. М*Х+Х-2 
RETURN 
END 
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------ 7------ Епа of function F(x} --=-----=--.-.-_ Ф 
Test Results of FALSE 
The program was used to find a root of the equation 
х2 +x-2=0 
using the initial values (1.5, 2.0) and 173.0, 0.0). Test results are given 


below: 
First run 


input starting values 

1.5 2.0 . 
SOLUTION BY FALSE POSITION METHOD 
Starting points do not bracket апу root 
Stop - Program terminated 


Second run 


Input starting values 


3.0 0.0 
SOLUTION BY FALSE POSITION METHOD 
х 
3.0000000 ~1. 0000000 
-3.0000000 1.6666670 
3.0000000 1.90909*0 
3.6000000n 
-3.000005 
3.000020 
3.0000000 t 
- es -).3929080 
-3.0000000 1.9999770 
з. 0000000 -1.99999 
~3. 0000009 1.9999990 
~3. 000000 2.0000000 
3.0000200 >. 065101 
3.0 

Root - -2. 6000000 

F(ROOT) - .0000000 

NC.OF ITERATIONS = 14 

Stop - Program termin. 


е thar the program орах 2 message when the given set of initial 
values do not bracket апу rose, When a root i- possible. the process ut 
iteration stops when the relative error satisfies the condition 


& 
a-i 


П 
is 
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tan =f) _ py) (6.19) 


uh 


where f(x,) is the slope of f(x) at x = 25. Solving for x, we obtain 


o fi) 
Wa Xi FG) (6.20) 
This is called the Neuton-Raphson formula. 
у = 
| if Hx) 
i | 
f 
i ‚© i 
UP DNE M 
0 = 
Fig. 6.5 Newton-Raphson method 
The next approximation would be 
à fixa) 
еее; 
In general, 
(6.21) 


This method of successive approximation is called the Newton-Raphson 
meth е process will be terminated when the difference between 
two successive values is within a prescribed limit. 

The Newton-Raphson method approximates the curve of f(x) by tan- 
gents. Complications will arise if the derivative f(x,) is zero. In such 
cases, a new initial value for x must be chosen to continue the procedure. 


146 Numerical Methods 


We can express f(r,,) using Taylor series expansion as follows: 


к) = fx, FG Fr ds 


1f we neglect the terms containing the second order and higher deriva- 
tives, we get 

fix, = fox) + Fh 
If x,,; is a root of fix), then 

fii) = 0 = fien) + А 


Then, 
EUM 
Po =хһ,1 Ха 
Therefore, 
az, f. 
Ха, ды: m f) " 
Newton-Raphson Algorithm 


Perhaps the most widely used of all methods for finding roots is the 
Newton-Raphson method. Algorithm 6.4 describes the steps for imple- 
menting Newton-Raphson method iteratively. 


Newton-Raphson Method, 


1. Assign an initial value to x, зау X; 
2, Evaluate f(x) and ffx) -~ 
3. Find the improved estimate of x; 


Kx) 

хо) 

. Check for accuracy of the latest estimate. 

Compare relative error to а predefined vane e | 
; Otherwise continue. Е 

X, by X, and repeat steps 3 and 4. 


x4-2Xo- 


Xi -xo 
xi 


sé 
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Find the root of the equation 
fe)2x'-3r42. V 
in the vicinity ofx = 0 using Newton-Raphson method. 
RENÉ шул а 
Let x, = 0 (first approximation) 


Similarly, 
5067 0.4444 395 
ху=0, MA. = 0.9333 
К... 6667 1.6667 0.9333 
0.071 
= 0.9333 -0:071 _ 9. 
„х1 = 0.9333 1334 0.9959 


0.0041 
x; =0,9959 00041 _ 0.0999 
959 -T0082 


0.0001 
х6 = 0. - — = 000 
хе = 0.9999 - "ЧИ. = 1.0006 


Since /(1.0) = 0, the root closer to the point х = 0 is 1.0007 


Convergence of Newton-Raphson Method 


Let x, be an estimate of a root of the function fix). If x, and x,,, are close 
to each other, then, using Taylor's series expansion, we can state 


R) 


йы) = flag) + {Чы бы а) + ey (629) 


2 


where R lies somewhere in the interval x, to х. and third and higher 
order have been dropped. 

Let us assume that the exact root of f(x) is x,. Then x,,, = x, There- 
fore f(x, ,,) = 0 and substituting these values in equation (6.22), we get 


[rà 


0 =/(х„) + Р Хх, x.) + z € ay (6.23) 
We know that the Newton's iterative formula is given by 
22 fe) 
ху “Fn f) 
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Rearranging the terms, we get 
f(x) = fq) On — Хон) 
Substituting this for flx,,) in Eq. (6.23) yields 
[rim 
zu 
We know that the error in the estimate x,,, is given by 
Enel 7 Xr Enl 


0 = (Xx, — хь.) + (x,- x, (6.24) 


Similarly, 
DEP 
Now, equation (6.24) can be expressed in terms of these errors as 
*(R) 
O= Га) esa + n 


Rearranging the Lerms we get, 


(6.25) 


ё 5 


Equation (6.25) shows that the error is roughly proportional to the square 
of the error in the previous iteration. Therefore, the Newton-Raphson 
method is said to have quadratic convergence. 


Program NEWTON 


The NEWTON shows а FORTRAN program for evaluating a root of non- 
linear equations by Newton-Raphson method. The program uses two 
external functions, F and FD and one intrinsic function, ABS. The func- 
tion F evaluates the actual function at a given value of x and FD evalu- 
ates the first derivative of the function at x. j 

"he program employs the Algorithm 6.4 and prints out the value of a 
root (when it is found) and the number of iterations required tn obtain 
the result. It also prints the value of the function at that point to check 
its accuracy. In case the process does not converge within a specified 
number of iterations, the program outputs a message accordingly. 


@ эы ater 


This program finds а root of a nonlinear equation 
y Newton-Raphson method 


* 
Main program Ф 


Functions invoked 


* NIL 
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* Variables used * 
й. ХО - Initial value of x * 
* XN - New value of x * 
* FX - Function value at x * 
* FDX - Value of function derivative at x * 
* COUNT - Number of iterations done * 
D * 
* Constants used * 
T EPS - Error bound * 


* MAXIT - Maximum number of iterations permitted + 
REAL X0,XN,FX,FDX,ABS,EPS,F,FD 
INTEGER COUNT, MAXIT 
INTRINSIC ABS 

EXTERNAL F,FD 

PARAMETER(EPS - 0.000001, MAXIT - 100) 
WRITE(*,*) "Input initial value of x 

READ(*,*) X0 

WRITE(*,*) 

WRITE(*,*) * SOLUTION BY NEWTON-RAPHSON METHOD’ 
WRITE(*, *) 


COUNT = 1 
100 FX = F(X0) 
FDX = FD(X0) 
XN = Х0 -FX/FDX 
IF(ABS((XN-X0)/XN) .LT.EPS) THEN 


WRITE(*,*) ‘Root = ', XN 
WRITE(*,*) ‘Function value = ', F(XN) 
WRITE(*,*) ‘Number of iterations = ', CCUNT 
WRITE(*,*) 

ELSE 
X0 = XN 


COUNT = COUNT + 1 
IF(COUNT.LE.MAXIT) THEN 
GO TO 100 
ELSE 
WRITE(*,*) 
WRITE(*,*) 'SOLUTION DOES NOT CONVERGE IN' 
WRITE(*,*) MAXIT, * ITERATIONS’ 
WRITE(*, *) 
ENDIF 


ENDIF 


REAL FUNCTION F(X) 
REAL X 


Р = X*X+X-2 


RETURN 
END 


End of function F(X) 


» d 


REAL FUNCTION FD(X) 


REAL X 

FD = 24X41 
RETURN 
END 


End of function ЕЮ(Х) ---------- * 


Test Results of NEWTON Given below are the outputs of the test runs of 
the program NEWTON. 


First run 
Input initial value of x 
в 


SOLUTION BY NEWTON-RAPHSON METHOD 
Root - 1.0000000 s 
Function value - .0000000 
Number of iterations 5 
Stop - Program terminated. 
Second run 
Input initial value of x 
-1.0 
SOLUTION BY NEWTON-RAPHSON METHOD 
Root - -2.0000000 
Function value = .0000000 
Number of iterations = 6 
Stop - Program terminated. 
Third run 
Input initial value of x 
1.0 
SCLUTION BY NEWTON-RAPHSON METHOD 
Root - 1.0000000 
Function value = .0000000 
Number oi iterations - 1 


Stop - Program terminated. 
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Show, through an example, that the number of correct digits approxi- 
mately doubles with each iteration in Newton-Raphson method. 

Given below is the output of NEWTON Program for solving the equation 
x? — 4x? + x + 6 = 0, using an initial estimate of 5.0. 


Iteration Estimation | Correct digits 
1 5.000000 NIL 
2 4.000000 NIL 
3 3.411765 3 
4 3.114462 ` 1 
5 3.013215 | 2 
6 3.000213 | 4 
1 3.000000 | 7 


This shows that the number of correct digits approximately doubles 
sOwith each iteration near the root. 


ations of Newton- son Method N N 


 Newton-Raphson method has certain limitations and pitfalls. The 
method will fail in the following situations. 
1. Division by zero may occur if '(х) is zero or very close to zero. 
2. If the initial guess is too far away from the required root, the 
process may converge to some other root, 
3. A particular value in the iteration sequence may repeat, resulting 
in an infinite loop. This occurs when the tangent to the curve f(x) 
at x = ху, cuts the x-axis again at x = x. 


SECANT METHOD 


nt method, like the false position and bisection methods, uses two 
initial estimates but does not require that they must bracket the root. 
For example, the secant method can use the points x, and х, in Fig. 6.6 
as starting values, although they do not bracket the root. Slope of the 
secant line passing through x, and х, is given by 


fix) fxs) 
Xp-Xg ху-®у 
Ро) (ху ~ x3) = Го) (x - ху) 
ог 
2g Йо) — fi] = fxg) x, — fay bry 
Then 


fign, - f) xs 


fG))- fi) 


(6.26) 


3 
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By adding and subtracting f(x;)x; to the numerator and rearranging the 
terms we get 


fx3)(z, -x1) 
„а = 27 
22 *2 а) Га) За 


Equation (6.27) is known as the secant formula (1f the secant line repre- 


sents the linear interpolation polynomial of the function f(x) (with the 
interpolating points x, and x.) then хз, which intercepts the x-axis, rep- 
resents the approximate root of f(x). 


хз = 


ао не line 


[ 1] 
Fig. 6.6 Graphical depiction of secant method 


The approximate value of the root can he refined by repeating this 
procedure by replacing x, and x, by х, and хз, respectively, in Eq. (6.27), 
That is, next approximate value is given by 


Nui чач) 
fixa) - f(xy) 


‘This procedure is continued till the desired level of accuracy is obtained. 
We can express the secant formula in general form as follows: 


Е O faper) 
sd Ает 
ЕУ (628), 


Note that Eqs (6.17) and (6.28) are simiter and both of them use two 
initial estimates. However, there is a major difference in their algo- 
rithms of implementation. In Eq. (6.17), the latest estimate replaces one 
of the end points of the interval such that the new interval brackets the 
root. But, in Eq. (6.28) the values are prefaced in strict sequence, i.e., 
x, 4 is replaced by x; and x, by x,,,. The points may not bracket the root. 


х4 7X3 
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Secant Algorithm 


Note that the value of new approximation of the root depends on the 
previous two approximations and corresponding functional values, Algo- 
rithm 6.5 illustrates how this procedure is implemented to estimate a 
root with a given level of accuracy. 


Secant Method 


Decide two initial points x, and х, accuracy level required, E. 
Compute f, = f(x,) and f = f(x) 


foxy =the, 


n> 


w 


Compute хз = 
2-h 
Test for accuracy of хз. 


a 


хз rel 


Li > E, then 


ху | 
59t X, = x, and f, =f 
set x, = X; and £, = f(x,) 
go to step 3 
otherwise, 
set root = x5 
print results 
5. Stop 


[отт] 


өле „secant method to estimate the root of the equation 
x*-4r-10-0 
al estimates of x, = 4 and x; = 2 


maim pe mmn WAS Io a e 


Гер) = f) = -10 
Р) = (2) = 14 
(Note that these points do not bracket a root) 


fixi) (хә ху) 


X; = - 
eta) = fle) 
PE gn от. EE 
MFI ey > =14+19 
For second iteration, jg 
=- BE 
х=х,=2 = 2 = 
m mous fes 
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%_=%g=9 
fœ) = f = -14 
Гоз) = f(9) = 95 
35(9 - 2) 
785 +14 


-i 


ху=9- 


For third iteration, 
m=9 
ху=4 

f(xy) = ft = 
Газ) = fi) = 


уай 


For fourth iteration, 


ху=4 
x)= 5.1111 
Га) = f = - 


Flay) = 5.1111) = ~ 4.3207 


ху =5.1111 


Vor fifth iteration, 


fip P 1111) = 4.3207 
fixa) = (5.9563) = 5.0331 


5.0331(5.9563 ~ 5.1111) 
ху = 5.9563 — Ossie eat 5,5014 
For sixth iteration, 
2 x, = 5.9563 
x, = 5.5014 
fo) = f(5.9539) = 5.0331 
f) = f(5.5014) = -1.7392 
å ~ 1.7392(5.5014 - 5.9563) 
75.5014- — 7399 1 5.0331 7392 : 5.0331 25, 6182 
The value can be further refined by continuing the process, if necessary. 
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Compare the secant iterative formula with the Newton formula for esti- 
mating a root. 


(f 1 
Newton formula: dui Se ЛТ 
filx,) 
CAT. е 
Secant formula: ыз =x, a a 


а) а) 


This shows that the derivative of the function in the Newton formula 
fa), has been replaced by the term 


fx,)- fix, 4) 


Xa Sy 


in the secant formula. This is a major advantage because there is no 
need for the evaluation of derivatives. There are many functions whose 
derivatives may be extremely difficult to evaluate. 

However, one drawback of the secant iterative formula is that the 
previous two iterates are required for estimating the new one. Another 
drawback of the secant method is its slower rate of convergence. It is 
proved later in this section that the rate of convergence of secant meth- 
od is 1.618 while that of the Newton method is 2. 


Convergence of Secant Method 
The secant formula of iteration is 


(6.29) 


Let x, be actual root of f(x) and е the error in the estimate of x. 
Then, 

ind е +2, 

x, =e; +x, 

Xia 64 tx, 
Substituting these in Eq. (6.29) and simplifying, we get the error equa- 
tion as 
ei Af (xi) ei f(x; 4) 

Ро) fiii) 

According to the Mean Value Theorem, there exists at least one point, 
say x = R; in the interval х; and x, such that 


(6.30) 


Sac 
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We know that 
fx)-0 
x,-X,- 6j 
and therefore 
"m 
е 
ог 


fix) = ef) 
Similarly, 
FG = ел P 
Substituting these in the numerator of Eq. (6.30), we get 


‚ое РОТ a) 
in 681 Fo firi) 
That is, we can say 
Cir FECA (6.31) 
J 


We know that the order of convergence of an iteration process is p, if 
eic eli (6.32) 


ej xo (6.83) 


or 


Substituting for e;,, and e; in Eq. (6.31), we get 


or 
eue Pn (6.34) 


Comparing the relations (6.32) and (6.31), we observe that 


p=(+l)p 
‘That is, 
p'-p-i-0 
which has the solutions 
B: 
AEN 
Since p is always positive, we have 
p- 1.618 


It follows that the order of convergence of the secant method is 1.618 
and the convergence is referred to as superlinear canvergence. 
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Program SECANT 


The program SECANT finds a root of a non-linear equation using two 
initial values supplied. The program employs a function subprogram, Е, 
to evaluate the value of the function and a subroutine subprogram, SEC, 
to implement the Algorithm 6.5 for estimating the root. The subroutine 
uses the absolute relative error in the successive approximations for 


* This program finds a root of a nonlinear * 
» equation by secant method * 


* Functions invoked 
* т 


Subroutines used 
* see id 


* Variables used 
* A - Left endpoint of interval * 
* B - Right endpoint of interval 
* ROOT - Final solution 

* COUNT - Number of iterations completed * 


ae ks Shea Ms pe ме 


* Constants used 
i EPS - Error bound 
*  MAXIT - Maximum number of iteration 


REAL A,B,ROOT,EPS,F 
INTEGER COUNT,STATUS, MAXIT 

EXTERNAL F,SEC 

PARAMETER(EPS = 0.000001, MAXIT = 50) 


WRITE(*, *) 

WRITE(*,*) ' SOLUTION BY SECANT METHOD‘ 
WRITE(*,*) 

WRITE(*,*) ‘Input two starting points’ 
READ(*,*) A,B 


CALL SEC(A,B,X1,X2,EPS,ROOT, COUNT, MAXIT, STATUS) 


IF( STATUS .EQ. 1 ) THEN 
WRITE(*,*) 
WRITE(*,*)' DIVISION BY ZERO' 
WRITE(*,*) 
WRITE(*,*)' Last x1 oP INI 
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WRITE(*,*)' Last X2 2 
WRITE(*,*)' ITERATIONS -',COUNT 
WRITE(*,*] 

ELSE IF( STATUS .EQ. 2 ) THEN 
WRITE (*,*) 
WRITE(*,*) ‘NO CONVERGENCE IN ',MAXIT,' ITERATIONS’ 
WRITE(*,*) 

ELSE 
WRITE(*,*) 
WRITE(*,*) ‘Root = ', ROOT 
WRITE(*,*) ‘Function value at root = *, Е (ROOT) 
WRITE(*,*) 
WRITE(*,*) ‘Number of iterations = ‘,COUNT 
WRITE(*,*) 

ENDIF 


SUBROUTINE SEC (A, B, X1,X2, EPS, ROOT, COUNT, MAXIT,STATUS) 


е» 


* Subroutine ы 
» This subroutine computes а root of an equation * 
* using the secant method . 
+ Argumer * 


+ MAXIT - Maximum iterations allowed a 
* Output * 
* х1 New left point 7 
€ X2 - New right poi " 
ы ROOT Final solution * 
* COUNT - Number of iterations done zi 
* STATUS - Status of completion of the task + 


* Local Variables 
ld X3,Fl,F2,ERROR L 


* Functions invoked ia 
* Р, ABS * 


ж Subroutines called s 
* NIL х 


Roots of Nonlinear Equations 159 


REAL A,B,Xi,X2,X3, EPS, ROOT, Fi, £2,#,ABS, ERROR 
INTEGER COUNT, STATUS, МА 
INTRINSIC ABS 
EXTERNAL F 
* Punction values at initial pei 
XL =A 
X2 = B 
Fl = P(A) 
F2 = F(B) 
* Compute the root iteratively 
COUNT = 1 
111 IF(ABS(F1-F2) .LE. 1.Е-10) THEN 
STATUS = 1 
RETURN 
ENDIF 
X3 - X2-F2 * (X2-X1)/(F2-F1) 
ERROR = ABS((X3-X2)/X3) 


* Test tor accuracy 
IF (ERROR .Gi'. EPS) THEN 
* =- Test for convergence 
TF( COUNT .EQ. MAXIT ) THEN 
STATUS = 2 


F2 © F(X3) 
COUNT = COUNT + 1 
бо TO 111 
* and compute next approximation 


ENDIF 


ROOT = X3 
SATUS = 3 


RETURN 


Rnd of subroutine SEC - 


* 


Brat аны аы а аш аана E A ЕБЕ 


REAL FUNCTION F(x} 
REAL Х 
F = X"X+x-2 
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RETURN 
END 
ж-------------- End of function F(X) ----- T * 
Test Results of SECANT 
Given below are the outputs of test runs of SECANT 
First run 


SOLUTION BY SECANT METHOD 
Input two starting points 
-3,0001 0 
Root = -2.0000000 
Function value at root = .0000000 
Number of iterations = 11 
Stop - Program terminated. 
Second run 
SOLUTION BY SECANT METHOD 
Input two starting points 


0: =3 
Root = -2.0000000 
Function value at root = .0000000 


Number of iterations - 8 
Stop - Program terminated. 


——_Stop_- Program terminates 
Note that the program incorporates a test for convergence and also a 
test for ‘division by zero’ while evaluating the secant formula (see Eq. 6.27). 


rr FIXED POINT METHOD 


Any function in the form of. 

f(x) =0 (6.35) 
can be manipulated such that x is on the left-hand side of the equation 
as shown below 

х = a(x) (6.36) 
Equations (6.35) and (6.36) are equivalent and, therefore, a root of 
Ед. (6.36) is also a root of Eq. (6.35). The root of equation (6.36) is given 
the point of intersection of the curves у = x and у = g(x). This intersec 
tion point is known as the fixed point of g(x) (see Fig. 6.7). 

The above transformation can be obtained either by algebraic manip- 
ulation of the given equation or by simply adding x to both sides of the 
equation. For example, 

x3 4x-2-0 
can be written as 
х=2-х 
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-—— 


х x—- 


Fig. 6.7 Fixed point method 


or 
х=х®+х-@+х=ах°* +2х-2 
Adding of x to both sides is normally done in situations where the origi- 
nal equation is not amenable to algebraic manipulations. For example, 
tanx=0 
would be put into the form of Eq. (6.36) by adding x to both sides. That is, 
x=tanx+x 
The equation 
х = glx) 
is known as the fixed point equation. It provides a convenient form for 
predicting the value of x as a function of x. If x, is the initial guess to a 
root, then the next approximation is given by 
x, = glx) 
Further approximation is given by 
Xp = go) 
This iteration process can be expressed in general form as 


х=) = 0,12... (6.37) 


which is called the fixed point iteration formula. This method of solution 
is also known as the method of successive approximations or method of 
direct substitution. 

The algorithm is simple. The iteration process would be terminated 
when two successive approximations agree within some specified error. 


a o 


Locate root of the equation 
x .x-2-0 
using the fixed point method. 
Тһе given equation can be expressed as Ў 
2a Q—2? 
Let us start with an initial value of x, = 0 


1322-022 f 
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х;=2-4=-2 

ху=2-4=-2 
Since x; — x, = 0, -2 is one of the roots of the equation. 
Let us assume that x, 


ag=2-1= 
Another root is 1. 


8H. NN 


Evaluate the square root of 5 using the equation 
gorx -5=0 
by applying the fixed point iteration algorithm. 


Let us reorganise the function as follows: 


Ae = Bix, 
and assume xp = 1. Then, 
ч x-b5* 
х=] 
ху=5 
*,21 


The process does not converge to the solution. This type of divergence is 
known as oscillatory divergence. 
Let us consider another form of g(x) as shown below: 
х=хї+х-6 


хз = 235 

ху = 55455 
Again it does not converge. Rather it diverges rapidly, This type of 
divergence is known as monotone divergence. 


Let us try a third form of gx). 


2х-5/х+х 
*5/x 
or x 
2 
Xo 
са 
хз = 2.3333 
х, = 2.2381 
ху = 2.2361 
хь = 2.2361 


This time, the process converges rapidly to the solution. The square root 
of б is 2.2361. 
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Convergence of Fixed Point Iteration 


As stated earlier, the iteration function g(x) can be formulated in differ- 
ent forms. Example 6.12 shows that not all forms result in convergence 
of solution. Convergence of the iteration process depends on the nature 
of g(x). Figure 6.8 illustrates various patterns of behaviour of the itera- 
tion process of the fixed point method. Figures 6.8(a) and 6.8(Ь), show 
that the solution converges to the fixed point x, during the iteration 
process. However, it does not happen in Fig. 6.Bc and 6.84. Notice that. 
the process converges only when the absolute value of the slope of y = 
gx) curve is less than the slope of y = x curve. Since the slope of y 
curve is 1, the necessary condition for convergence is 
giz) «1 

We can also notice that, in the neighbourhood of the solution, if the 
slope of g(x) is positive, the convergence is monotone with “staircase” 
behaviour, and if the slope of gix) is negative, the convergence is oscilla- 
tory in behaviour. It is also clear that the closer the slope of glx) is to 
zero, the faster will be the convergence of the process. 


900 


+ 


ee 0н В Жу ЖНГ 
T X ох XN K 


(c) Monotone divergence 


X X X x X 
(b) Spiral convergence (d) Spiral divergence 
Fig.68 Patterns of behaviour of fixed point iteration process 
We ean theoretically prove this as follows: 


The iteration formula is 
ELI (6.38) 
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Let x; be a root of the equation. Then, 


x; = glx) (6.39) 
Subtracting equation (6.38) from equation (6.39) yields 
Xp 14 = В(хр) - gx) (6.40) 


According to the mean value theorem, there is at least one point, say, 
x = R, in the interval ху and x, such that 


g(xy)- glx;) 
a 
‘This gives 
gp - gx) = #'(ЙХх- х) 
Substituting this in Eq. (6.40) yields 
xj-3,42 (К-х) (6.41) 
1f e, represents the error in the ith iteration, then Eq. (6.41) becomes 
ĉi +1 = gv, (6.42) 
This shows that the error will decrease with each iteration only if 
Е) <1 
Equation (6.42) implies the following: 
1. Error decreases if g'(R) « 1 
2. Error grows if g'(R) > 1 
З. ТЕДА) is positive, the convergence is monotonic as in Fig. 6.8(a) 
4. If g'(R) is negative, the convergence will be oscillatory as in 
Fig. 6.8(b) 
5. "The error is roughly proportional to (or less than) the error in the 
previous step; the fixed point method is, therefore, said to be /in- 
early convergent 


Program FIXEDP 


The program FIXEDP is the simplest of all programs discussed so far for 
determining a root of a nonlinear cquation. The iteration process is 
terminated when two successive approximations agree within some врес- 
ified error. The program uses a control loop to terminate the execution 
when the process does not converge within a specified number of itera- 
tions. 


PROGRAM FIXEDP 
Bane reer camer a iain 
* Main program 
* This program finds a root of a function using 
* the fixedp point iteration method 


P сео йрн EA —€— a 


te eee 


* 


Vai 


* Constants used 


Functions invoked 


Subroutines used 
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G, ABS 


riables used 

X0 - Initial guess 

X - Estimated root 
ERROR - Relative error 


«ж ажаа э © 


Ерб - Error bound 
MAXIT - Maximum Iterations allowed 


* 


REAL X0, X, ERROR, G, ABS, EPS 
INTEGER MAXIT 

INTRINSIC ABS 

EXTERNAL G 

PARAMETER (EPS - 0.00001) 


WRITE (*,* P 

WRITE (*,*) 'SOLÜTTON BY FIXED POINT ITERATION METHOD' 
WRITE(*,*) 

WRITE(*,*) "Input initial estimate of root' 

READ (*,*) X0 

WRITE(*,*) 'Maximum iterations allowed’ 

READ(*,*) MAXIT 

WRITE(*,*) 

WRITE(*,*)' ITERATION VALUE OF X ERROR’ 


DO 100 I = 1, MAXIT 


100 


х = G (X0) 
ERROR = ABS((X-X0)/X) 
WRITE(*,*) I,X,ERROR 
IF (ERROR .LT.EPS) THEN 
WRITE (*,*) 
STOP 
ENDIF . 
x0 - X 
CONTINUE 


WRITE (*,") ‘Process does not converge to а root’ 
write(*,*) ‘Exit from loop’ 


STOP 
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ДЕ НАНА 


* Function 


REAL FUNCTION G(X) 
REAL X 


G = 2.0-X*X 


RETURN 


*-----<----—---- End of function G(X) --------- 


Test Results of FIXEDP The outputs of the program FIXEDP for various 
initial values are shown below: 
First Run 

SOLUTION BY FIXED POINT ITERATION METHOD 

Input initial estimate of root 

9.0 

Maximum iterations allowed 

10 


ITERATION VALUE OF X ERROR 
i 2.0000000 1.0000000 
2 0000000 2.0000000 
3 0000900 * 0000000 
Stop Program terminated. 
Second Run 
SOLUTION BY FIXED POINT ITERATION METHOD 
Input initial estimate of root 


1 

Maximum iterations allowed 

10 
ERROR 

-0000000 


ITERATION 
3 
Stop - Program terminated. 


— E temmuated. L L mms 
[БИ DETERMINING ALL POSSIBLE ROOTS 


All the methods discussed so far estimate only one root. What if we are 
interested in locating all the roots in the given interval? One option is to 
plot graph of the function and then identify various independent inter- 
vals that bracket the roots. These intervals can be used to locate the 
various roots. 

Another approach is to use an incremental search technique covering 
the entire interval containing the roots. This means that search for a 
root continues even after the first root is found. The procedure consists 
of starting at one end of the interval, say, at point a, and then searching 
for a root at every incremental interval till the other end, say, point b, is 
reached (see Fig. 6.9). The end points of each "incremental interval" can 


ү) 
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Fig. 6.9 incremental search for ali possible roots 


serve as the initial points for one of the bracketing techniques discussed. 
Algorithm 6.6 describes the steps for implementing an incremental search 
technique using the bisection method for locating all roots. 

A major problem is to decide the increment size. A small size may 
mean more iterations and more execution time. If the size is large, then 
there is a possibility of missing the closely spaced roots 


No 


naon 


| Determining all roots | 


. Choose lower limit a and upper limit 6 of the interval covering all 


the roots. 
Decide the size of the incremental interval ^x 


, Set x = aand x= x, + х 
. Compute f, = х, and f = Ax) 


If f, x f > 0, the interval does not bracket any root 
go to step 9 

Otherwise, 
continue 


. Compute x, = (x, + X)/2 and f; = Axo) 


Mf f, x h < O, then 
set х= X 
else 
set x, = x, and f, = f 


. ИЦ» — хр < E, then 


root = (X, + x)/2 
write the value of root 
go to step 9 
else 
go to step 6 
if X, < b, then set a = x, and go to step 3 


. Stop 


Algorithm 6.6 | 
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[53 SYSTEMS OF NONLINEAR EQUATIONS 


А system of equations is a set consisting of more than one equation, A 
system of n equations in n unknown variables is given below. 


fiy хь ex) = 0 
fiy хь. Xn) = 0 
(6.43 


fiii xy s x.) = 0 


Equation (6.43) requires values for x,, ху, ..., x, such that they satisfy 
all the n equations simultaneously. If these equations can be expressed 
in the form 


Fix) = a,x, + agza +. жа, х,-с= 0 
then the system is said to be linear. On the other hand, if they involve 
variables with powers, then the system is said to be nonlinear. 
For example, 


х? + 2х —y =o 
2? + 3ху= 4 
is a system of nonlinear equations in two unknowns. These equations 
can be expressed in the form of equation (6.43) as 
f(x, y)=x" + 9x -y*- 2-20 (6.44) 
go у)= 2+ 8ху-4 =0 (6.45) 


Solution of these equations requires values of x and у that could satisly 
both of them simultaneously. We will discuss two methods in this section 
for solving such equations. 


Fixed Point Method 


One simple approach for solving a system of nonlinear equations is to 
use the fixed point iteration method. Equations (6.11) and (6.15) can be 
written in the form 

x = Fix, у) 

у = Gtx, y) 

We can compute x and y using some initial values of x and y on the 
right-hand side. The new values of x and y can again be used to compute 
the next set of x and y values. This process can be repeated till a desired 
level of accuracy in the computed values is reached. This iterative process 
can be represented in general form as 


җы = Foy) 
Ум = GG, уд (6.46) 
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This can be implemented using the steps given in Algorithm 6.7. 


Fixed point method for a system 


1. Detine iteration functions 
F(x, y) and G(x, y) 
2. Decide starting points x, and y, and error tolerance Е 
3, х, = Ехо, Yo) 
у = (хь Yo) 
4. If Ixy — Xj < E and 
ly, — yol < E, then 
solution obtained; 
go to step 6 
5. Otherwise, set 
х= 
у= 
до to step 3 
6. Write values of x, and y, 
7. Stop 


ess] 


method. 


=у+ 3 
aty 
6-х? 
у= 
x 
Assume xp = 1 and y = 1 
-25 
»-5 


уз = 13 
‘The process does not converge. We have to solve the system by forming 
another set of equations for x and y- 


120 Numerical Methods 


Following an approach similar to the one discussed in Section 6.10, it 
can be shown that the iteration process converges if the following 


equations are satisfied. 
дЕ) |96 
е 
| 
|“ 1 (6.47) 


дЕ | 
+ 
ži y 
The task of forming appropriate iterative functions Fix, y) and G(x, y) to 
satisfy the above conditions may become very difficult and, therefore, 


the fixed point iteration process is rarely used to solve systems of non- 
linear equations, 


Newton-Raphson Method 


The Newton-Raphson method, which was discussed in Section 6.8 for 
solving single nonlinear equations, can be extended to systems of non- 
linear equations. Recall that a first order Taylor series of the form 


ant 
26 


fi) = Го) + (бк, — x) РО) (6.48) 
was used to derive the Newton iteration formula 
‚з f(x) 
Me Tf uS (6.49) 


for solving one equation. For (he sake of simplicity, let us again consider 
a two-equation nonlinear system 


fix,y)=0 gmyso 
First order Taylor series of these equations can be written as 


FO sry 344) = Fey y) + (ay — x) | *ua-» | (6.50а) 
s as Уша) = ву) + G1, - x) ET uai | (6.50Ь) 


If the root estimates are x,,, and y,,,, then 
«ыл» Уы) = 85, Уа) = 0 
Substituting this in Eq. (6:50) we get the following two linear equations: 
Axf,+Ayfp+f=0 (6.51а) 
Ахду + Ayg; +Е = 0 (6.51b)) 


where we denote 
Ax Etpa 
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AY = Yen Yi 
fh ES fa E 
^ d 


fzfa,y) 8 = Еб y) 
Solving for x апа у, we get 


f:ga-8 fa | Dx , 
Ax =-———_——— = - = (6.52а) 

figa-fag D 

g:fı-f-g, __Dy 

T. иы ы ЗИВ 2А (6.52b) 

fhids-hg, D 

where 
fi 


fits - Bik 


is called the Jacobian matrix. From Eq. (6.522) and (6.52b), we can 
establish the following recurring relations: 


(6.53(а) 


(6.53) 


Equations (6.534) and (6.530) are similar to the single-equation New- 
ton formula and may be called the two-equation Newton formula. "These 
equations can be used iteratively and simultaneously to solve for the 
roots of f(x, y) and gíx, y). 

Algorithm 6.8 lists the steps involved in implementing the Newton 
iteration formula for a two-equation system. 


Two equation Newton-Raphson method 


1. Define the functions f and g 
2. Define the Jacobian elements 
f. for 9 and ga 
3. Decide starting points х, and y and error tolerance Е. 
4, Evaluate f, д, fy, for 91, Go 8! (Xo. Yo) 
Compute Dx, Dy and D 
ху = X% — DWD 
у = Yo - DyD 


(Contd.) 


5. Test for accuracy. 
Wo |x, х « Eand 


go 
7. Write results 
8. Stop 


Let Ех, у)= х2 +ху-6 
Сбх, у) = x-y -3 


DEL naey 


E] 


> 
v 


% 
i 

SIS 95 S18 & 
he 


Б 
" 


Assume the initial guesses 


Tteration 1 
Ah=3f,=1 
£72,855 7-2 
and therefore 
D=-6-2=-8 


‘The values of functions at x; and y; 
Р=1?+1х1-6=—4 
G-r-r-3 =-3 
(4)—2) - (-3) (1) 


x= Е = oe 
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C33) - (-4) (2) 
21-0875 
€ C8) 
Iteration 2 
fı = 2% 2.375 + 0.875 = 5.625 


2,375)" + (2.375) (0.875) — 6 = 1.71187 
G = (2.375)? — (0.875) = 4.8750 
D = (5.625) (71.75) — (4.75) (0.875) 
=- 9.8436 - 4.1563 =-14 
(17187) (- 1.75) - (4.875) (0.875) 
-14 


X, =2,375- 


=2.315- (8.0077) 743636 = 2.375 - 0.5195 


(4.875) (5.625) - (1.7187) (4.75) 
-14 
27.4218 - 8.1638 
a 2.2506 
Continue further to obtain correct answer. 


ROOTS OF POLYNOMIALS 


We have seen that the methods discussed so far can also be used for 
evaluation of the roots of polynomials. However, these methods run into 
problems when the polynomials contain multiple or complex roots. 
Polynomials are the most frequently used equations in science and 
engineering and, therefore, require special attention in terms of evaluation 
of their roots. In this section, we discuss methods to determine all real 
(not necessarily distinct) and complex roots of polynomials, These methods 
are specially designed for polynomials and, therefore, cannot be used for 
transcendental equations. 
We will try to use the following properties of nth degree polynomials: 
1. There are л roots (real or complex) 
2. A root may be repeated (multiple roots) 
3. Complex roots occur in conjugate pairs 
4. Tfn is odd and all the coefficients are real, then there is at least one 
real root 
Б. The polynomial can be expressed as 
р(х) = (x — x) q(x) 
where x, is a root of p(x) and g(x) is the quotient polynomial of 
ordern- 1 


уз =0.815- 
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The number of real roots can be obtained using Descartes’ rule of sign. 
This rule states that 
1. The number of positive real roots is equal (or less than by an even 
integer) to the number of sign changes in the coefficients of the 
equation 
2, The number of negative real roots is equal (or less than by an even 
integer) to the number of sign changes in the coefficients, if x is 
replaced by ~x 


Multiple Roots 


A polynomial function contains a multiple root at 2 point when the 
function is tangential to the x-axis at that point. For example, the equation 
xn + Mix -9=0 
has a double root at x = 3 (see Fig. 6.10(a)), The graph is tangent to the 
x-axis at this point. Similarly, the equation 
х*- 10x" + 3622 — 56x + 32 = 0 

has a triple root at x = 2 (see Fig. 10(b)), Note that the curve crosses the 
x-axis for odd multiple roots and turns back for the even multiple roots 
‘This means that the bracketing methods will have problems in locating 
the even multiple roots. Another problem is that both /(x) and its 
derivative f'(x) become zero at the point of multiple roots. As a 
consequence, the methods (Newton-Raphson and secant) that use 
derivatives in the denominator might face the problem of division by 
zero near the roots. 


i multiple root 
ax) UU Ji 
1j 2 3 4 — 


(a) ҚА) 2 x* - 7x? + 15x- 9 


! 
" en] 
Г 1 2 3 4 5 


(b) ҚА) = x* — 10x? + 36x? – 56x + 32 
Fig. 6.10 Graph of multiple root polynomials 
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Deflation and Synthetic Division 
We stated that a polynomial of degree n can be expressed as 
pix) = (x - x) qi) 

where x, is a root of the polynomial р(х) and q(x) is the quotient polynomial 
of degree n – 1. Once a root is found, we can use this fact to obtain а 
lower degree polynomial qx) by dividing p(x) by (x — x,) using a process 
known as synthetic division. The name “synthetic” is used because the 
quotient polynomial q(x) is obtained without actually performing the 
division. The activity of reducing the degree of a polynomial is referred 
to as deflation. й 

The quotient polynomial g(x) can be used to determine the other roots 
of p(x), because the remaining roots of p(x) are the roots of g(x). When a 
root of q(x) is found, a further deflation can be performed and the process 
can be continued until the degree is reduced to one. 

Synthetic division is performed as follows: 


Let pto = Ўах! 
ra 
and 
" LER 
чо) = X bx! 


i 
If px) = (x - x,) q(x), then 
a, Y'a, E ee ee) 
2G -x)(bua x! box + on b x + bo) (6.54) 
By comparing the coefficients of like powers of x on both the sides of 
equation (6.54), we get the following relations between them: 
а, = bya 
an-ı = ba = Xr bn 


a, = bo- xb 

ag = ~x; bo 
That is 

а=һу-х bs ienn-le0 
where b, = 5, , = 0. 
Then 


bia =a; +2, 6, (6.55) 


ee 
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Equation (6.55) suggests that we can determine the coefficients of glx) 
G.e., b, |, Б.з, ... by) from the coefficients of pix) (Le., Gp, a, 4, — ау) 
recursively. Thus, we have obtained the polynomial q(x) without 
performing any division operation. 


The polynomial equation 

P(x) = x3 – 712 + 15x -9 =0 
has a root at x = 3. Find the quotient polynomial gix) such that. 
P(x) = (x — 3) qix) i 


From р(х), we have 
ay = 1, az = -7, а = 15, and ay = -9 


Gy + by х3 
by =a, + by 87 154 (-12) 
Thus the polynomial gíx) is 


dv+3=0 
ES 


Evaluation of all real roots, including multiple roots, using Newton- 
Raphson method and synthetic division technique for deflation is 
presented in Section 6.14 


Complex Roots 


Computing complex roots is much more complex than computing real 
multiple roots. Recall that complex roots of polynomials with real 
coefficients occur in conjugate pairs. This Suggests that we should isolate 
the roots of these types by finding the appropriate quadratic factors of 
the original polynomial (rather than linear factors). Quadratic factors 
can be obtained by using the process of synthetic division. 

Let us assume that 


h(x) = x? - ux —v 
is an “approximate” quadratic factor of p(x). Then 


p(x) r(x) 
@) бк) E (6.56) 
ho ^ 49 * а) 


where q(x) is the quotient polynomial of degree (n - 2) and r(x) is the 
remainder, Note that if h(x) is an exact quadratic factor of p(x), then r(x) 
would be zero. Equation (6.56) can be rewritten as 


р(х) = glx) htx) + rx) 
= g(x) G* — ux — v) + rix) (6.57) 
Since q(x) is a quotient polynomial, it would be of the form 
qu) = b x7 b, ax + + b, (6.58) 
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Let us assume that the remainder r(x) takes the form 
rix) = By (x — u) + by (6.59) 


(The form of r(x) is chosen for the convenience of manipulation), 
The objective is to determine the factors и and v such that r(x) becomes 
zero and, therefore, A(x) becomes an exact factor of p(x) given below. 


Px) =a, 2" wa, x 5 +... ta, x eas (6.60) 


Substituting Eqs (6.58), (6.59) and (6.60) in Eq. (6.57) and comparing 
coefficients, we obtain the following relations: 


bn =а, 
ba = Oya + Ud, 
6,370, + Ub, vb, 


bı =a) + ub, +06, 
bg = ay + ub; + vb, 


(6.61) 


1 м 
where i =n, n- 1,.. 0 
bari = bng = 0 
Note that all the coefficients b; are functions of v and v which are 


unknown. 
Tt is clear that A(x) is a factor of p(x) if and only if 


| &i a, + ub, + ob, - 0 (6.62) 
by= ag + ub + vb, = 0] 
Note that Eq. (6.62) is a system of two nonlinear equations in two 


unknowns, и and v. These equations can be solved by using Newton's 
method discussed in Section 6.14. 
Once the values of u and v are known, the roots of the equation 
2 
x- ux- v 


can be easily determined using the formula 


The process can be repeated for the quotient polynomial till it becomes 
either a quadratic or linear polynomial which can be solved for their roots. 


Purification of Roots 


Purification, as the name indicates, is the process of refining the roots 
that do not satisfy the required accuracy conditions. These roots may be 
used again for testing the original problem and improving their approx- 
imntions. 
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‘The Newton-Raphson method is a popular one used for purification of 
roots, The values of the roots obtained through other methods are used 
as "initial" input values to the Newton method. 


98 MULTIPLE ROOTS BY NEWTON'S METHOD 


As discussed earlier, we can locate all real roots of a polynomial by 
repeatedly applying Newton-Raphson method and polynomial deflation 
to obtain polynomials of lower and lower degrees. Algorithm 6.9 gives а 
step-by-step procedure to achieve this. 

Note that the deflation process is performed (n — 1) limes where n is 
the degree of the given polynomial. After (n — 1) deflations, the quotient 
is a linear polynomial of type 


a xta, 70 


and therefore the final root is given by 


ао 
x, =-=— 
ay 


= 
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‚ Obtain degree and coefficients of polynomial (л and a) 
Decide an initial estimate for the first root (x9) and error criterion 


Dowhilen>1 | 
Find the root using Newton-Raphson algorithm: 


m1 
Visio da Te] 


eres 


4. Root (n) = x, 

5. Deflate the polynomial using synthetic division algorithm and make 
the factor polynomial as the new polynomial of order n — 1 

6. Set x, = x, (initial value for next root) 


Program MULTIR 


‘The program MULTIR locates all real roots of a polynorr al by repeated- 
ly applying the Newton-Raphson method as showr іп Algorithm 6.9. To 
achieve this, the program employs two subroutines: first, the subroutine 


De 
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NEWTON to find a real root of the polynomial, and second, the subrou- 
tine DFLAT to reduce the polynomial degree by one. This process is 
continued till the degree of the polynomial is reduced to one. This is 
implemented by the DO loop DO 200 1 = N, 2, -1. 

The subroutine NEWTON, while evaluating a root, also implements a 
test for accuracy of the root obtained. In case the required accuracy is 
not obtained within a specified number of iterations, the execution stops 
after giving an appropriate message. 


* Main program 
* The program finds all the real roots of 
* a polynomial Ж 


* Functions invoked 
= NIL 


Subroutines used 
NEWTON 
DFLAT 


A - Polynomial coefficients A(N«1) E 
XÜ - Initial guess 
XR - Root obtained by Newton method * 
ROOT - Root Vector * 


* 
^ 
* 
* N - Degree of polynomial 
* 
` 
` 
‚ 
* STATUS - Solution status * 


Constants used 
EPS - Error bound * 
MAXIT - Maximum iterations permitted 
REAL A,X0,XR,ROOT,EPS 
INTEGER N,MAXIT,STATUS 
PARAMETER( EPS-0.000001, MAXIT-50 ) 
DIMENSION A(11), ROOT(10) 


WRITE(*,*) 
WRITE(*,*) ' EVALUATION OF MULTIPLE ROOTS 
WRITE(*,*) 


WRITE(*,*) 'Input N, the degree of polynomial* 
READ(*,*) N 

WRITE(*,*) ‘Input poly coefficients, A(1) to A(N*1)' 
READ(*,*) (A(I), I=1, №1} 

WRITE(*,*) 'Input initial guess X' 
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READ(*,*) X0 


WRITE(*,*) 
ро 200 I = N, 2, -1 
= Find I_th root 


CALL NEWTON(N,A,X0,EPS,MAXTT, STATUS, XR) 


IF (STATUS .EQ. 2) THEN 
DO 100 J = N, 1+1, -1 


100 WRITE(*,*) 'ROOT',J,' -', ROOT(J) 
WRITE(*,*) ‘Next root does not converge in’ 
WRITE(*,*) MAXIT, ‘ iterations’ 
WRITE(*,*) 

STOP 
ENDIF 
ROOT(1) = XR 
ы Deflate the polynomial by division (Х - XR) 
CALL DFLAT(N,A,XR) 
40 = 3x 
Ы Proceed to find next root 


200 CONTINUE 


* Compute the last root 
ROOT(1) = - A(L)/A(2) 


* Write results 


WRITE(*,*) 'ROOTS OF POLYNOMIAL ARF: ' 
WRITE(*,*) 
DO 309 I = 1, N 
WRITE(*, ‘ROOT’, I,” =", ROOT(Z) 
300 CONTINUE 
WRITE(*,*) 
STOP 


SUBROUTINE NEWTON(N,A,X0, EPS, MAXIT, STATUS, XR) 
* Subroutine 
* This subroutine finds a root of the polynomial 
* using the Mewton-Raphson method 


* N - Degree of polynomial 
^ ^ - Array of polynomial coefficients 


* 
* 
* 
* 
* 
* 


* 


D 


X0 - Initial guess 


EPS - Erro: 


MAXIT - Maximum 


Output 


STATUS - Solu 


r bound 
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for a root 


n status 


iterations permitted 


XR Root obtained by Newton е 


Local Variables 


COUNT - Number of 


iterations performed 


FX - Value of polynomial function at x0 
FDX - Value of function derivative at x0 


Functions invoked 


NIL 


REAL A,X0,EPS,XR,ABS 
INTEGER N,MAXIT,STATUS 
INTRINSIC ABS 

DIMENSION А(11) 


COUNT = 1 


Compute the value of function at x0 
100 FX = AIN+1) 


DO 111 I = 


М, 1, a 


FX = FX * X0 + A(I) 
111 CONTINUE " 
* Compute the value of derivative at X0 


FDX = А(М+ 
DO 222 1 = 


1) *N 


N, 2, -1 


FDX = FDX * X0 » 
222 CONTINUE 


* Compute a root xR 


XR = X0 - 


FX/FDX 


* Test for accuracy 
ІР (ABS | (XR-X0) /XR) . LE. EPS) 


STATUS - 
RETURN 
ENDIF 


1 


* Test for convergence 


IF(COUNT .LT. MAXIT) THEN 


ҸО = XR 
COUNT = 
GOTO 100 


COUNT + 1 


AIT) 


(1-1) 


THEN 
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ELSE 
STATUS = 2 
RETURN 
ENDIF 


END 


End of subroutine NEWTON 


Subroutine 
* This subroutine reduces the degree of polynomial 
* by one using synthetic division 


* Arguments: 
^ Input 

* N - Degree of polynomial 
* А Array of coefficients of input polynomial 
* XR - A root ot the input polynomial 

* Output 

» A - coefficients of the reduced polynomial 
mime 

* Local Variables 
' 3 
* 

* 

* 

* 

* 


Functions invoked 
NIL 


RFAL A,B,XR 
INTEOER N 
DIMENSION А(11), B(i1) 

Evaluate the coefficients of the reduced polynomial 


B(N+1) = 
1011 = 
B(I) = А(1.1) + XR * B(I«1) 
CONTINUE 
Change coefficients from B array to A array 
bo 221 = 1, №1 
А(1) = B(T) 
2 CONTINUE 
RETURN 
END 


* 


P 


End of subroutine DFLAT — 


Roots of Nonlinear Equations 183 


Test Results of MULTIR The program was tested for evaluating the roots 
of the equation 


22 -8r+2=0 
The results of a test run are given below: 
EVALUATION OF MULTTPL 


ROOTS 


Tnput N, the degree of polynomial 


2 
Input poly coefficients, A(1) to A(H«1) 
2-3 1 


Input initial guess X 
0 


ROOTS OF POLYNOMIAL ARE: 


ROOT 1 = 2.0000000 
ROOT 2 = 1.0000000 


Stop - Program terminated. 


СОТ Б] COMPLEX ROOTS BY BAIRSTOW METHOD 


We have discussed in Section 6.13 that complex roots of a polynomial 
equation can be found by using its quadratic factors. We have also seen 
that if the polynomial 

1 


р(х) =a, x” +а 4x"! +... ayx + ay 
is divided by quadratic factor 
hi) =x? - ux- v 


then the result is a polynomial 
qx) = b,x? + b, x3 +. + by 
with a remainder 
r(x) = Бух - u) + by 
The values of coefficients b; are given by the following recurrence formula: 


6, =a, 
by 1 70,4 + Ub, (6.63) 
b; = a; + ubi, + vb (fori = п – 2 to 0) 
We know that in order to make A(x) an exact factor of p(x), r(x) should be 
zero. This implies that 


by = by =0 
We know from the above recurrence formula that 
by =a; ub, + obs = 0 


by = ag + ub, + vb, 


184 Numerical Methods 


The problem now is to find the solution of the system of equations 
biu ,0)=0 (6.64) 
biu ,v)- 0 
Remember, these are nonlinear equations because coefficients b, are 
functions of u and v. The strategy used to solve the system of Eqs. (6.64) 
is known as Bairstow's method. The method is similar to the Newton- 
Raphson approach for solving a two-equation system (discussed in Sec- 
tion 6.12), Using the Taylor series expansion (recall Eq. (6.51)), it can be 
Shown that 


abo ao 
Gu but ap av = bo 


To solve these equations, we need partial derivatives of 6, coefficients. 
Differentiating Eq. (6.63) with respect to u, we get 


(6.65) 


Ф, D, Dbi, 
Stab tu ae gy S ien-2to0 (6.66) 
ob, 
CO 
Dni Ф, 
= =b, +u =" b, 
For convenience, let us denote 
= Ф, 
IET 
eH but he +0 i-n-2t00 | (6.67) 
We need the following coefficients of c, 
[ 3, 
ME EL 
Ф, 
аг аб 


c, and со can be evaluated recursively using Eq. (6.67). Now, differenting 
Eq. (6.63) with respect to v, 
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(6.68) 


That is, 
а, = ud sidus ie n-2to0 (6.69) 
We need the following coefficients of d, 


Again, d; and d, can be recursively valuated using equation (6.69). 
If we compare Eqs (6.67) and (6.68), it is clear that d, values are 
identical to с; values. That is 
dice  fori=nto0 
Then, d; = с; and d, = сү. This implies that we need not compute the 
coefficients d;. 
Substituting for partial derivatives in terms of c values in Eq. (6.65) 
we get 
c; Au +c, Av = -b, 
со Au + су Av = -bo 
Then, 
Буер Вос 


2 
€i -сос, 
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сї -bico 


e? -epez 


Now, given the initial values of uo and vy, we can estimate the values of 
и and v using the following recurring relations 


bye, - doe. 

di mp (6.702) 
176062 
ei - bye, 

[EU E (6.70b) 


с? -сосз 


Note that the main task in Bairstow's method is the evaluation of b, and 
c, coefficients using the Eys (6.63) and (6.67). Algorithm 6.10 lists the 
steps to implement Bairstow's method 


Complex roots by Bairstow's method | 


1. Get polynomial parameters (л and а, values) 
2. Decide initial estimates, my and v; and stopping criterion 


[ Whien>2:Do | 


Compute 6, coefficients 
Compute с, coefficients 
Compute 

D= ск C = Gx о 
(0, хс — by» суур 
hs x € — b x D 


Dao 


у= ур + AV 
6. Төз! for accuracy of u and v. if accuracy is ok, then 
solution obtained 
go to step 8 
7. Otherwise, set 
usu 
Wav 
go to step 3 
8. Find (complex) roots of х2 - ux-v=0 
write results 
9. Set the coefficients of factor polynomial as а; 
n=n-2 
а= By.» (for i= nto 0) 
10. Set next values for и; and v, 
Uy =U 


wav 
| | End of While-Do | 


(Сота) 


у= -ајар, 
find (complex) roots 
write results 
else 
single root = -a/a, 
write results 
12, Stop 


Obtain the quadratic factor of the polynomial 
р(х) = +х+ 10 
method with starting values и = 


ay = 1, ay = 0, a, = 1, ag = 10 


Then 
b,=1 
by = ay + ub; = 0 + (41.8) x 1 = +18 
by = a; + ub, + Ubs = 1 + (41.8) (41.8) + (-401) = 0.24 
o = ag + ub, + vb 
= 10 + (41.8) (0.24) + (-4)(1.8) = 3.232 
cy =0 
21 
су = by + ис, + ves = 18 + (91.801) + (-4 x 0) = 43.6 
Cy = b, + ису +0 Co 
= 0.24 + (41.8) (43.6) + (-4 x 1) = 3.72 
D =c? - сус, = (48.67 – 3.72 х1 = 9.24 
byes -сос; 
Аи = кш aca 
(0.24) (3.6) - (3.232) x 1 
MM „0. 
9.24 anon 
boc; -b160 
ы D 


(3.232) (3.6) — (0.24) (3.72) 
Se = 1.16: 
924 1.1616 
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и = 1.8 + 0.2563 = 2.0563 

v z-4- 1.1616 = -5.1616 
Note that the true values of u and v are 2 and -5 respectively, There- 
fore, the estimated values are close to the true values. These values can 
be refined by further iterations. 


Program COMPR 

The program COMPR can locate all the real and complex roots of an 
equation. The program COMPR uses Bairstow's method to achieve this, 
The program logic is detailed in the Algorithm 6.10 and implemented as 
shown in Fig. 6.11. 


Program COMPR 


КД outeur | 
Fig. 6.11 Implementation of algorithm 6.10 to evaluate complex roots 


The subprogram INPUT obtains data for polynomial and initial val- 
ues of the quadratic coefficients. The subprogram BSTOW finds the 
quadratie factor using multivariable Newton's method and also obtains 
the reduced polynomial, The subprogram QUAD solves the quadratic 
equation, the details of which are supplied by BSTOW through the main 
program COMPR. Finally, the subroutine OUTPUT displays the roots of 
the quadratic equation. 
+ 


locates 


This program 
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* A - Array of coefficients of polynomial 

*  U0,V0 - Initial values of coefficients of the 
» quadratic factor 

* u, V - Computed values of coefficients of the 
* quadratic factor 

* B - Coefficients of the reduced polynonial 

*  X1,X2 - Roots of the quadratic factor 

+ TYPE - Type of roots (real, imaginary or equal) 


* Constants used 
PS - Error bound 


INTEGER N, TYPE 

REAL A,B,UO,VO,U, V, X1,X2, EPS, DO, D1, D2 
PARAMETER( EPS = 1.E-6 ) 

DIMENSION A(11),B(11) 

WRITE(*,*) 

WRITE(*,^) EVALUATION OF COMPLEX ROOTS’ 
WRITE(*,*) 


CALL INPUT(N,A,U0,V0) 
100 IF(N.GT.2) THEN 


* ---obtain a quadratic factor 
CALL BSTOW(N,A,B,UO,V0,U, V, EPS) 
02 = 1 
Dl = -U 
ро = -V 


find roots of the quadratic factor 
CALL QUAD(D2,D1,D0,X1,X2, TYPE) 
*----print the roots 
CALL OUTPUT (М, TYPE, X1,X2) 
*-—-get the coefficients of the factor polynomial 
N = N-2 
po 200 І = 1, N41 
A(I) = B(T+2) 
200 CONTINUE 


* 


set initial values for next quadratic factor 


00 = у 

у = у 

GOTO 100 
ENDIF 


IF(N.EQ.2) THEN 


*——polynomial is a quadratic one 
CALL QUAD(A(3),A(2),A(1).X1,X2, TYPE) 


n 


see 
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CALL OUTPUT(N,TYPE,Xl,X2) 
ELSE 
*----last root of an odd order polynomial 
ROOT - - A(1)/A(2) 
WRITE(*,*) 
WRITE(*,*) ‘Final root = *, ROOT 
WRITE(*,*) 


ENDIF 
STOP 


End of main program COMPR 


SUBROUTINE INPUT(N,A,U0,VO) 


Subroutine 


This subroutine reads polynomial details and 


initial values of the quadratic coefficients 


N - Degree of polynomial. 
A - Polynomial coefficients 
vo,vo - 

Local Variables 
NIL 


Functions invoked 
NIL 


€ * ** cow oO EO * X €x x 


REAL A,U0,VO 
INTEGER N 
DIMENSION A(11) 


WRITE(*,*) ‘Input degree of polynomial 
READ(*,*) N 


al values of the quadratic factor 


(ш), 


WRITE(*,*) ‘Input polynomial coefficients A(N+1) 


DO 11 І = N41, 1, -1 
READ(*,*) A(T) 
11 CONTINUE 
WRITE(*,*) ‘Give initial values 00 and 


to А(1)/ 


vor 
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READ(*,*) U0,VO 
RETURN 


SUBROUTINE BSTOW(N,A, B,UC, VO, TI, V, EPS) 


E 
* Subroutine * 
* This subroutine finds rhe quadratic factor using * 
* multivariable Newton's method and also finds the * 
* reduced polynomial . 


* Arguments = 
* Input * 
* N - Degree of polynomial * 
* A - Polynomial coefficients * 
*  Q0,V0 - Initial guess tor the coefficients 


® ef the quadratic factor * 
< EPS - Error bound * 
* Output = 
$ U,V - Computed coefficients of the quadratic * 

factor s 
d B - Coefficients of the reduced polynomial В 
^ * 
* Local Variables * 
» D, DELU, DELV,C * 
> x — + 
* Functions invoked = 
* — ABS * 
E aomen MH Ed * 
* Subroutines called Ё 
* NIL " 
—————— €—————— * 

INTEGER N 


REAL A,B,UC,V0,U, V, EPS, D, DELU, DELV,C 
INTRINSIC ABS 
DIMENSION A(1:), В(11), С(11) 


COUNT - 1 
100 B(N+1) = A(N+1) 
B(N) = A(N) + UO * BIN«1) 


DO 131 I = Nel, 1, =1 
B(I) = A(I) + UO * В(1+1) + VO * B(I«2) 
111 CONTINUE 


C(N+1) = 0 
C(N) = B(N+1) 
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DO 222 І = N-1, Ч, -1 
C(I) = В(1+1) + 00 * C(I«1) + VO * C(I+2) 
222 CONTINUE 


D = C(2) * c(2) - c) * c(3) 

DELU - -(B(2) * C(2) - B(1) * C(3))/D 

DELV = -(B(1) * C(2) - B(2) * C(1))/D 

U = 00 + DELU 

V = VO + DELV 

IF( ABS(DELU/U).LE.EPS .AND. ABS(DELV/V).LE.EPS ) THEN 
RETURN 

ENDIF 

IF(COUNT .LT. 100) THEN 
00 = y 


WRITE(*,*) 
WRITE(*,*) “МО CONVERGENCE IN 100 ITERATIONS’ 
WRITE(*,*) 
STOP 
ENDIF 


End of subroutine BSTOW - 


SUBROUTINE QUAD(A,B, C, X1,X2, TYPE) 


* Subroutine .* 
* This subroutine solves a quadratic equation of * 
* 2 * 
* type AX + BX « C * 


* * 
* Input * 
* A,B,C - Coefficients of the quadratic equation d 
* Output * 
* Х1,Х2 - Roots of the quadratic equation - 
* TYPE - Type of roots * 


* Functions invoked 
* SORT, ABS 
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* Subroutines called 
= NIL * 


INTEGER TYPE, IMAGE, EQUAL, UNEQUAL 
REAL A,8,C,X1,X2,SQRT,ABS 
INTRINSIC SQRT,ABS 


PARAMETER( IMAGE = 1, EQUAL - 2, UNEQL - 3) 
Q-B*B-A4*A»*c 
IF(Q.LT.0) THEN 


Roots are complex 
X] = -B/(2*A) 
X2 = SORT(ABS(Q))/(2*A) 
TYPE - IMACE 
ELSE IF(Q.EQ.0) THEN 
--Roots are real and equal 
-B/(2*A) 


Roots are real and unequai 
ХІ = (-B + SQRT(Q))/(2*A) 
X2 = (-B - SQRT(Q))/(2*A) 
TYPE - UNEQL 

ENDIF 


RETURN 
END 


End of subroutine QUAD 


* * 
* Subroutine Ы 
, This subroutine displays the roots of the 

* quadratic equation * 
. е * 
* А 
* М - Degree of the polynomial from which b 
Е the quadratic factor was obtained * 
* TYPE Type of roots " 
*  Xl,X2 - Roots of the quadratic factor * 
* Output * 
* NIL А 
TS Erie pep deter ccc et * 

* 


^ 
* 


NIL 


жии More TETTE + 
* Functions invoked * 
* NIL . 
T. ^ 
* Subroutines called * 
* NIL + 


PARAMETER( IMAGE = 1, EQUAL = 2, UNEQL = 3 } 


WRITE(*,*) 
WRITE(*,*) ‘Roots of quadratic factor at n = ',N 
WRITE(*,*) 
IF(TYPE .EQ. IMAGE) THEN 
WRITE(*,*) 'Rootl = 
WRITE(*,*) 'Root2 s ', X1, ' - ', X2,'j' 


PLSR ТР(ТҮРЕ .EQ. EQUAL) THEN 
WAITE(*,*)  "Rootl = ', Xl 
WRITE(*,*)  "Root2 = *, Xl 

ELSE 
WRITE(*,*)  "Rootl = ', Xl 
WRITE(*,*)  'Root2 = ', X2 

ENDIF 

RETURN 

END 

жо------------- End of subroutine OUTPUT ------------- * 
Test Results of COMPR 


EVALUATION OF COMPLEX ROOTS 
Input degree of polynomial (N) 
3 
Input polynomial coefficients А(М+1) to A(1) 
1 


0 

1 

10 

Give initial values 00 and VO 

1.8 -4.0 

Roots of quadratic factor at n - 3 


Rootl = 1.0000000 + 2.0000000j 
Root2 = 1.0000000 - 2.00000007 


Final root - -2.0000000 


Stop - Program terminated. 
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(eae MULLER'S METHOD 


Muller's method is an extension of the secant method. Mullers method 
uses a quadratic curve passing through three points (x, /(x,)), (xs, f(x5)) 
and (xs, f(x3)) as shown in Fig. 6.12 to estimate а root of f(x). One of the 
roots of the quadratic polynomial p(x) is taken as an approximate value 
of the root of f(x), As illustrated in Fig. 6.12, the point x,, one of the 
roots of p(x), is assumed as the next approximation for the root of F(x). 
We can write the quadratic polynomial p(x) in the form 

рх) = ау + ау (х- c) + ag (к-с)? (6.71) 
Equation (6.71) is known as the shifted-power form of the polynomial 
and c is a constant known as the centre. If we choose c = x, then 
Eq. (6.71) becomes 

рб) = ag + @y(x — xa) + ах — x3)? (6.72) 
Since x, is a root of p(x), at x = x, p(x) = 0 and, therefore, Eq. (6.72) 
becomes 

ay (x4 ~ x3)? + a, — ху) ag = 0 
Solving the quadratic equation for (x, — x4) we get 


-2 
iei] (6.73) 


X4-Xg = 
аў 


‘This is one of the forms of quadratic formula, chosen here to minimise 
error due to any subtractive cancellation. The constants ау, a, and a; 
can be obtained in terms of known function values f(x), f(x), and fixy) 
as follows: 


f(x) 


Fig. 6.12 Illustration of Muller's method 
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At x = x), xg and ху, we have 
ax, — ха)? + ax, — xg) + ag = pix) = fixi) 
вух;, — x + ai, ~ ха) + ао = pix) = fla) 
аха — х3) + ај (ку — xg) + ао = р(х) = fixa) 
Letting h; = x, — ху and hy = xs — xs, and denoting f; = f(x;), we get 
ah? +ауһу+ау=[\ 
azh? + ауз + a5 = fa 
04+0+a,=fy 
Since a, = fs, we can obtain a, and a; by solving the equations 
аз hy ka ly 7 fi fae di 
a, h? + ay hy = fa- fa = ds 
This results in 
dh? ~d,h? 
a, == 
hiha (hy - hz) 


тен dih -dshi 
27А — hy) 
Equation (6.73) can be written as 
x =x thy 
where 


-2an 


„====== 
a, +a? -Aaga 


The sign in the denominator of h, is chosen such that A, is as small in 
magnitude as possible so that x, is close to x,. That is, the magnitude 


ota; + fa? —4ауав ) should be large. 


This process is then repeated using x;, x; and x, as the initial three 
points to obtain the next approximation xs. 
Xs =x, + hg 
The process is continued till f(x) is within the specified accuracy. 
Algorithm 6.11 lists the steps in detail for computing a root by Muller's 
method 


h, 


Muller's Method 


. Decide the initial three points and stopping criterion 
Compute f, = f(x), f = f(x), & = fox) 
Compute 
h, = x% — Xy, = %2 3 
д&=һ-Ь%=Ь-Һ 


ем 


(Contd.) 


(Сота.) 
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oo 


10. 
1t. 


- Compute parameters ap, а, a; 


. Compute x, = x, + h 
. Compute f, = f(X) 
. It f(x,) satisfies the given criterion, then 


Anh 
dah? -а;һ2 
-Mha (hy а) 


a 


_ ih аә 
"һә (hy - he) 
Compute h 


(choose the sign in the denominator such that its magnitude is the 
largest. That is, if a, is positive use + sign, otherwise, — sign) 


root is obtained, 
go to step 10 
Otherwise, set 
X = Xa, My = Xa, X, = X, and 
ff „б, = fy then 
go to step 3 
Write the value of root (x,) 


Stop | 


8 — 3 


Algorithm 6.11 


Solve the Leonardo equation 


FG) = х + 2x? + 10x - 26: 0 


by Muller's method 
Let us assume the three starting points as 


Iteration 1 


x =0,%2 = 1,x,=2 
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dy = fo fa=-28 
D = hy hg (hy - hy) 
= 2-2 +1) =-2 
23-2)? -( -1)2 
a, um 23y-2) 36) =28 
-2 
(-36X-1)- (-23-2) 
LET E 


5 


Же -2x16 
28 + /282 - 4(5)(16) 


ө 
49.540659 
= -0.645934 
x4 = ху + h = 13540659 
Iteration 2 


(choosing + sign) 


z=1 

х=? 

x5 = 1.3540659 

hy = ху = ху = -0.8540659 
hy xy - ху = 0.645934 


fe 

ћ= 16 

fs = F(1.3540659) = - 0.3096797 
d, = f, - f, = - 66903202 


dy = f, - fa = 16.3096797 
2 (i, ~ hg) = 0.2287031 


а= ы-ы = 6.3540717 


aj = fa = -0.3096797 


Za = 06193594 .. 0,0145813 


e tjoi-4a,a, 4247622 


x4 =x; + h = 19686472 
This process can be continued to obtain better accuracy. The correct 
answer is 1.368808107. 


h= 
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Complex Roots 


Note that, in Example 6.17, we obtained a real root of the polynomial. In 
some cases, we may encounter complex approximations while solving 
Eq. (6.73), However, in such cases, the imaginary component will nor- 
mally be small in magnitude and it enn be neglected. 

In case we are interested in the complex roots as well, we can obtain 
these by implementing the Muller algorithm using complex arithmetic 
(which is supported by FORTRAN). 


Multiple Roots 


‘The algorithm can be modified to find more than one root by incorporat- 
ing the deflation procedure using the following equation as discussed in 
Section 6.13; 


fo) 


х-г\ 


f'G- 


Program MULLER 


Design and development of a program to implement Muller's method is 
left to the reader as an exercise, 


[5323 SUMMARY 


In this chapter, we defined various forms of nonlinear equations and 
stated а number of approaches to find the roots of such equations. We 
discussed in detail the following iterative methods to evaluate a root: 
* Bisection method (also known as interval halving method) 
* False position method (also called linear interpolation method) 
* Newton-Raphson method 
* Secant method 
* Fixed point method (also known as method of direct substitution) 
* Muller's method 
We also discussed the solution of a system of nonlinear equations using 
* Fixed point method 
* Newton-Raphson method 
We further presented two methods to find the roots of polynomials: 
* Newton-Raphson method with synthetic division 
* Bairstow's method (for real as well as complex roots) 
We discussed the process of converging of iterative methods and proved 
that 
+ Newton-Raphson method converges with order of 2 
* Bisection method converges linearly 
* False position method is linearly convergent 
* Secant method follows superlinear convergence 
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We presented FORTRAN programs and test results for the following 


methods: 
Bisection method 
False position method 


Newton Raphson method (single root) 


Fixed point method 


Newton-Raphson method (multiple roots) 
Bairstow's method (for complex roots) 


. 
. 
E 
• Secant method 
. 
D 
. 


Key Terms 


Algebraic equation 
Analytical method 
Bairstow's method 
Binary chopping method 
Bisection method 
Bracketing method 
Complex number 
Complex root 
Convergence 

Deflation 

Descartes’ rule 

Direct substitution method 
Extrapolation method 
False position method 
Fixed point equation 
Fixed point method 
Graphical method 
Halt-interval method. 
Horner's rule 
Incremental search 
Interpolation method 
Iterative function 
Iterative method 
Jacobian matrix 

Linear 

Linear interpolation 
Linearly convergent 
Monotone convergence 


Monotone divergence 
Muller's method 
Newton-Raphson formula 
Newten-Raphson method 
Nonlinear 

Open end method 
Polynomial equation 
Purification 

Quadratic convergence 
Quadratic equation 

Real root 

Regula falsi 

Repeated roots 

Roots 

Search bracket 

Secant formula 

Secant method 
Shifted-power form 
Spiral convergence 
Spiral divergence. 
Stopping criterion 
Successive approximations 
Superlinear convergence 
Synthetic division 

Trial and error 
Transcendental equation 
Zeros 


. What is a nonlinear equation? Give an example from real-life prob- 


lems. 


f. What is an algebraic equation? Give two examples. 
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3. Polynomial equations are a simple class of algebraic equations. 

Explain. 
С What is a transcendental equation? What are its characteristics? 

5, What is meant by direct analytical method of solution? What are 
its limitations? 

6. When do we seek the help of graphical method for solving a nonlin- 
ear equation? 

7. What is an iterative technique? How is it implemented on a com- 
puter? 

8) Describe the concept applied in the bracketing methods used for 
solving nonlinear equations 
How do we decide initial guess values for solving a polynomial 
equation using 

(а) open end methods, and 
(b) bracketing methods? 

10. What is meant by stopping criterion? State some of the tests that 
сап be used for terminating an iterative process. 

11, What is Horner's rule? How does it improve the accuracy of evalua- 
tion of a polynomial? 
Explain the principle of bisection method with the help of an illus- 
tration, 

Explain the principle of false position method. 

State the Newton-Raphson formula and explain how it is 

E -used to. 


t formula and the false position formula are 
milar. Then what is the difference between these two methods? 
Vi. How does the secant method compare with the Newton-Raphson 
method? 
18. Discuss the situations where the fixed-point iteration process may 
not converge to a solution. 
19. Describe an algorithm to determine all possible roots of an equa- 
tion. 
20. State the limitations of using the fixed-point approach for solving a. 
system of nonlinear equations. 
21. State the Descartes’ rule to estimate the number of real roots of a 
polynomial. 
22. What is synthetic division? How is it used to obtain the multiple 
roots of a polynomial? 
23. What is deflation? 
24. What is meant by purification of roots? How is it done? 
25. Muller’s method is an extension of secant method. Explain. 
26. Compare, in a tabular form, the order of convergence of various 
iterative methods used for solving nonlinear equations. 
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A Evaluate the following polynomials using Horner's rule: 
(a) fc) = 22 «48 2x «5. atx 23 
(b) fix -2Xy!.5:410 аіх=5 
(с) Ло) = хі + 249-16х+5 atx 
A Prove that the bisection method is linearly convergent. 
ua How would you decide the two initial values that are required for 
using the bisection method? 


Find a root. of each of the following equations using the bisection 


)e-x-2-20 
(b) sin x- 2x + 1= 04 
(c) log x - cos x = Ox 
x tan x- ae EF 
(е) x —x- 3-0: 
@ &%-ж-в=бм — 
(g) x'-z)Ó-x-320 
Derive the false position formula for evaluating a root of a nonlin- 


ar equation. = 
the false position formula repeatedly and obtain roots of the 
following equations : 


= )x-e*-0 
28 sinx-x+2=0 
(с) х? – dx? +x+6=0 
) 32 66-45-0007 С" 
(о) 4 ~ 2х-6=0м ж —~ 
Derive the Newton-Raphson iterative formula 


в [(х„) 
Fo 


кем еа. 


for solving f(x) = 0 
8. Show that the Newton-Raphson method converges to solution qua- 
dratically. 
. Obtain the Newton's iterative formula for evaluating the square 
root. of a number. Use this formula to find the square root of 3. 
10. Derive a recursive formula for finding the nth root of a number, 
say A. 
11. Show that Newton's formula for finding the reciprocal of A is 
хь = х„(2- Ах„) 
Find the Newton-Raphson formula for the following functions: 
(a) fix) =x? - 2х-1 
(b fix) =x*-x-3 
(c) fi) =x - 3x - 2 
(d) fx) = cos x 


9^ 
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(е) f(x) = хе“ 
(D f) -xtanx-1 


№23. Apply Newton’s method to бй ће roots of the following equations: 


18. 


19. 


20. 


21. 


(а) e*-x-0 
(b) log x – cos x = 0 
с) tanx- х= 0% 
(d) x- 1.5 віпх- 2.5 = 0 
‘ompute a root of each of the following equations using Newton- 


Raj 
5х+6=0, 


(b) x - 1.25 + 2x - 24 = 0, 
ж pedi er 


(d) x° + 3x5 - 2:* — 12x - 8 = 0, 
gl -%2- 102 0, DER 
iive the secant баја, How ix it different Bou ti false posi- 


tion formula. 


. Prove that the rate of convergence of secant method is better than 


that of bisection method or false position method. 


. Use the secant method to compute a root of the following equa- 
tions: 

(a) 42" – 2x-6=0 (d) &-3x-0 

(b) х?-бх+6=0 (е) x-& 42-20 

(с) xsinx- 1=0 (D х5 — 3х2 - 100 = 0 


Derive a condition under which the error in the fixed-point itera- 
tion method will decrease with each iteration. 
Use the fixed-point iteration method to evaluate a root of the equa- 
tion 
x?-x-1-0 

using the following forms of gix): 

(a) x=x -1 

(b) х=1+2х—-х? 


© х=} (143-2) 


starting with (i) x; = 1 and (ii) x; = 2. Discuss the results, 
Find the square root of 0.75 by writing f(x) = x? — 0.75 and solving 
the equation 
x =x’ 4+x-0.75 

by the method of fixed-point iteration. Assume an initial value of 
ху = -0.8. Try with an initial value of x, = 0.8. Comment on the 
results. 
Use a suitable method to find to three decimal places the roots of 
the following equations. 

(a) x* -x-6-0 

(b) х2 + 2x- 0520 

(с) x? - 10 x орх = 0 

(d) x - 2 - 3x 10-0 
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22 


‚ Solve the system of equations 
x +y=5 
12 -уё-1 


using (a) fixed-point method and (b) two equation Newton-Raphson 
method. Assume x, = 1 andy, = 1. 
. Use Newton's method to solve the following systems of equations: 
(а) Ax? ~ ay? = 1 
х? 0х +у®+2у=8 
1 and y, = 1) 


2=0 
(Assume x, = 1 and y, = 1) 


24. The polynomial 


26. 


pix) =x- 6? + 1\х-6=0 
has а root at x = 2. Find the quotient polynomial (x) such that 
P(x) = (x ~ 2) q(x) 

- A box open at the top is made from a rectangular piece of plywood 
measuring 5 by 8 metres by removing square pieces from the cor- 
ners. What will be the size of square pieces removed if the volume 
of the box is to be 20 cubic metres? 

. The supply and demand functions of a product are 

Qs = p*- 500 
Qd = р? - 60р + 1500 
Determine the market equilibrium price which occurs when 
Qs = Qd. 
» Use Muller's method to find a root of the following equations: 
(а) r'-x-2,:; 21,212 and xy = 1.4 
(b) 1+ 2r- tanz, x, = 15, х. = 14 and x, = 1.3 
. Use Bairstow's method to estimate the roots of 
Го) = х – 233 + 42 4+4 


29. In the figure shown below, estimate the angle 0 in radians (to two 


decimal places) using Newton's method (ог any other method), Area 
of triangle ABC equals area shaded. 


€ 


Also show that there is only one answer in the interval 0 and 7/2. 


30. 


31. 


32. 
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The equation x tan x — I occurs in the theory of vibrations. 
(a) How many roots does it have in the interval 0 and л/2 
(b) Estimate them to two decimal places. 
The flux equation of an iron core electric circuit is given by 
Аф) = 10 - 2.19 - 0.019" 
The steady state value of flux is obtained by solving the equation 


Г) = 0. Use a suitable method to estimate the steady state d. 
The state of an imperfect gas is given by van der Waals’ equation 


a 
(p+ )o-p=Rr 


or 
ри? — (Вр + RT) v? + av-afi- 0 
Solve the equation for v(molar volume) given the following: 
р (pressure) = 1.1 
T (temperature) = 250° К 
R (gas constant ) = 0.082 
a=36 
В = 0.043 
Use any suitable method. 


‚ Develop a program to compute all the roots of a polynomial using 


the bisection method. Use Algorithm 6.6. Test the program for 
x -6*.1x-6-0 


. Madify the above program to use Newton-Raphson method instead 


of bisection method and test the program. 


. Write a program to solve a system of nonlinear equations using 


(a) fixed-point method (Algorithm 6.7) 
(b) Newton-Raphson method (Algorithm 6.8) 


. Write a program for computing a real root of an equation using 


Muller's method. (Algorithm 6.11). 


. Modify the program in Project 4 to implement the Muller algorithm 


using complex data type supported in FORTRAN to compute com- 
plex roots, 


. Design a menu-driven program to compute a root of a given equa- 


tion. The menu will provide the choices of methods that a user can 
select, depending on the nature of equation 


Direct Solution of 
Linear Equations 


|| NEED AND SCOPE 


Analysis of linear equations is significant for a number of reasons, First, 
mathematical models of many of the real world problems are either 
linear or can be approximated reasonably well using linear relationships, 
Second, the analysis of linear relationship of variables is generally easier 
than that of nonlinear relationships. 

A linear equation involving two variables x and y has the standard 
form 


(7.1) 


Notice that the exponent (power) of variables is one. The equation becomes 
nonlinear if any of the variables has the exponent other than one. 
Similarly, equations containing terms involving a product of two variables 
are also considered nonlinear. 

Some examples of linear equations are: 


4х + ly = 15 
-x — 2/3y = 0 
Зи – 20 = -1/2 
Some examples of nonlinear equations are: 
2х-ху+у=2 
+%+у®=05 


х+ух=б 
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In practice, linear equations occur in more than two variables. A 
ear equation with n variables has the form 


ауху + 25) + @уХ + a,x, =b (7.2) 


here a; (i = 1, 2, ... п) are real numbers and at least one of them is not 
то. The main concern here is to solve for x; , 2, .. п), given the 
lues of a; and b. Note that an infinite set of x; values will satisfy the 
sove equation. There is no unique solution. If we need a unique solution 
' an equation with п variables (unknowns), then we need a set of n 
ich independent equations. This set of equations is known as system of 
multaneous equations (or simply, system of equations). 
A system of n linear equations is represented generally as 

ах + 01202 ect at = b, 

gi; + Arata + ai = by 


(7.3) 


api% + @Х +... ns = b, 
n matrix notation, Eq. (7.3) can be expressed as 
Axsb (7.4) 

(here A is ап n x n matrix, b is an n vector, and х is a vector ofn 
inknowns. 

The techniques and methods for solving systems of linear algebraic 
quations belong to two fundamentally different approaches: 

1. Elimination approach 

2. Iterative approach 

Elimination approach, also known as direct method, reduces the given 
system of equations to a form from which the solution can be obtained 
py simple substitution. We discuss the following elimination methods in 
this chapter: 

1, Basic Gauss elimination method 

2, Gauss elimination with pivoting 

3, Gauss-Jordan method 

4, LU decomposition methods 

5. Matrix inverse method 

The solution of direct methods do not contain any truncation errors. 
However, they may contain roundoff erro! ue to floating point 
operations. : 

Iterative approach, as usual, involves assumption of some initial values 
which are then refined repeatedly till they reach some accepted level of 
accuracy. Iterative methods are discussed in Chapter 8. 


Be EXISTENCE OF SOLUTION 


In solving systems of equations, we are interested in identifying values 
of the variables that satisfy all equations in the system simultaneously. 
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Given an arbitrary system of equations, it is difficult to say whether the 
system has a solution or not. Sometimes there may be a solution but it 
may not be unique. There are four possibilities: SS ae 
Ха System has a unique solut g 

№. System has no solution 

wo System has a solution but not a unique one (i.e, it has infinite 


solutions) 
P System is ill-conditioned 
45. [3x - 1.5y, 
x 2x -зу=4 Б. ?х-у=5 
| -5y-2) A 
g o| 7 X— 
(а) System with unique solution. ib) System with no solution 
X- 2y= -2 
y 2x+ Зу= 6 y| 
| ASK- 0.91y = -1 
4x- 6y - -12 „2 
we Lo | 
0 == 9 porum 
(c) System with infinite solutions (9) -conditioned system 
Fig. 7.1 Various forms of a system of two linear equations 
Unique Solution 
Consider the system " 
x*2y29 
2x-3y-4 


The system has a solution 
. x-5 and yo? 


Since no other pair of values of x and y would satisfy the equation, the 
solution is said to be unique. The system is illustrated in Fig. 7.1(a). 


Solution 
The equations 
2х-у=5 
Зх – 3/2у = 4 
have no solution. These two lines are parallel as shown in Fig. 7.1(b) 


and, therefore, they never meet. Such equations are called inconsistent. 
equations. 
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Unique Solution 
К эн» 
2х + 3у=6 
4x – ву = -12 


has many different solutions. We can see that these are two different 
forms of the same equation and, therefore, they represent the same line 
(Fig. 7.1(c)}. Such equations аге called dependent equations. 

The systems represented in Figures 7.1(b) and 7.1(с) are said to be 
singular systems 


-Conditioned Systems 


"я Ан Р Р 
Where may һе a situation where the system has a solution but it is very 


[s dee 
oues {о being singular. For example, the system 
x-3y--2 
0.45x - 091y = -1 
has a solution but it is very difficult to identify the exact point at which 
the lines intersect (Fig. 7.1(d)). Such systems are said to be ill-conditioned. 
Tll-conditioned systems are very sensitive to roundoff errors and, there- 
fore, may pose problems during computation of the solution. 
Let us consider a general form of a system of linear equations of size m x n. 
апд + аха +... + арх, = dy 
арх +@%Х; +... dos Xs = бе 


аы + X) +... + Amnn = b, 

In order to effect a unique solution, the number of equations m should 
by equal to the number of unknowns, n. If m « n, the system is said to be 
under determined and a unique solution for all unknowns is not possible. 
On the other hand, if the number of equations is larger than the number 
of unknowns, then the set is said to be over determined, and a solution 
may or may not exist. 

The system is said to be homogeneous when the constants b; are all 
zero. 


a SOLUTION BY ELIMINATION 


Elimination is a method of solving simultaneous linear equations. This 
method involves elimination of a term containing one of the-unknowns 
in all but one equation. One such step reduces the order of equations by 
one. Repeated elimination leads finally to one equation with one unknown. 
Some rules that are useful in manipulation of the equations are: 

1. An equation can be multiplied or divided by a constant. 
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2. One equation can be added or subtracted from another equation. 
3. Equations can be written in any order. 
For example, the system 


2х+у=4 
бх-2у=1 
can be written in different forms as follows: 
1. 42+ 2у =8 
Бх – 2у =1 
2. -3x + Зу= 3 
2х+у =4 
3, бк 2у =1 
2х+у =4 


Consider a general form of three linear equations: 
ад + 2X3 + 04513 = by 
азуху + Anxa + 023X3 = by (7.5) 
agi, + @8Х; + уух = b3 
We have three unknowns and three equations. Our objective is to 
modify this set to the following form: 
ах + aX + йууху = by 
айху+афрх, +0 =b4 
ах +афх +0 = bi 
This represents a new set of equations with x, eliminated in the last 
two equations. The last two equations represent a set with two unknowns. 
This system can be further transformed into the form 
ay + ау + gta = by 
айху+афх› +0 =b7 
афху+0+0 =bf 
Now, the last equation has only one unknown and, therefore, its value 
can be obtained as 


By substituting this in the second equation, we can obtain the value 
of xz. Finally, x; can be solved using the computed values of x, and x; in 
the first equation. 

Remember that the three-equation system (Eq. (7.5)) can also be 
transformed into the following form: 

ацал + арха + ах = by 
0*afz,*0 -bi 


ağız, +0+а$ухз - bi 
Note that a prime indicates that the co^ srients have been modified. 
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The elimination process basically involves the addition of multiples of 
one equatiog to other equations so as to set the coefficients of one of the 
variables iA these (other) equations to zero. Example 7.1 illustrates this 
process. 


е following system of equations by tł the process of elimination. 
ax + 2y +221057 
ZE + 3y + 2z = 14 — 
Txt y+ z= 
^ Thee FUE агаа involves the following steps: 
Step 1: Elimination of x from second and third equations 


Multiply first equation by 2/3 and subtract the result from the second 
equation. This gives 


Б/Зу + 4/32 = 22/3 
or By + 4у = 22 
Similarly, multiply first equation by 1/3 and subtract the result from 
the third equation. This gives 
iy + Be = 32 
After step 1, we have the following first derived system; 
Зх + 2у +2 = 10 
бу + 42 = 22 
у+22= B 
Step 2: Elimination of y from the third equation іл the derived system 


Multiply second equation in the derived system by 1/5 and subtract the 
result from the third. This results in 


62 = 18 
The system now has been reduced to an upper triangular form: 
Зх + 2у 42-10 
Бу + dz, 22 
62 = 18,4 
The derivation of this upper triangular system of equations is called 
the forward elimination process. 
We can now solve these equations as follows: 
2= 18/6 =3 
Then, 
by +4x3=22 
Therefore, 


у= (22-4 х 305 =2 


212 Numerical Methods 


Finally, 
3х+2х2+3=10 
x-(10- 703 = 1 
Computation of unknowns from the upper triangular system, ав illus- 
trated here, is kni ав back substitution. 


кй 


We have seen in Example 7.1 how to solve a system of three equations 
using the process of elimination. This approach can be extended to sys- 
tems with more equations. However, the numerous calculations that are 
required for larger systems make the method complex and time consum- 
ing for manual implementation. Therefore, we need to use computer- 
based techniques for solving large systems. Gaussian elimination i is one 
such technique. 

Gauss elimination method proposes a systematic strategy for reduc- 
ing the system of equations to the upper triangular form using the 
forward elimination approach and then for obtaining values of unknowns 
using the back substitutton process. The strategy, therefore, comprises 
two phases: 

1. Forward elimination phase: This phase is concerned with the ma- 
nipulation of equations in order to eliminate some unknowns from 
the equations and prodnce an upper triangular system. 

2. Back substitution phase: This, phase is concerned with the actual 
solution of the equations and uses the back substitution process on 
the reduced upper triangular system, 

Let us consider a general set of n equations in n unknowns: 


zb, 


Ag X, dg Xy +.. gs X, = by 


IC GAUSS ELIMINATION METHOD 


Gy Xptüy + ay Ty 


(7.6) 


ар Xp + Ana Xa +. жа, х, 2b, С 
Let us also assume that a solution exists and that it is unique. Algo- 


rithm 7.1 illustrates the steps involved in implementing Gauss elimina- 
tion strategy for such a general system. 


Gauss elimination (basic) method 


1. Arrange equations such that 2., 0 
2. Eliminate x, from all but the first equation. This is done as follows: 
(i) Normalise the first equation by dividing it by а,,. 
(i) Subtract from the second Eq. а times the normalised first 
equation. 


(Contd) 
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(Conta.) 


Thus, the resultant equation does not contain х,. The new 
second equation is 


0+ aja X2 t bq X, = DS 
(iii) Similarly, subtract from the third Eq. а,, times the normalised 
first equation. 
The result would be 
0+ abo Xo +..+а5х„ = b^ 
If we repeat this procedure till the nth equation is operated 
оп, we will get the following new system of equations: 
ац Ky + aya XS + Ain Xn = D 


Aig Xp ++ Abn Xn = D5 


np Xo to. t By Xp = b; 
The solution of these equations is the same as that of the 
Original equations. 
3. Eliminate x; from the third to the last equation in the new sot. 
Again, we assume that аз, #0. 
(i) Subtract from the third equationa 4g times the normalised sec- 
‘ond equation. 
(ii) Subtract from the fourth equation,az; times the normalised 
second equation, 
and so on. 
This process will continue till the last equation contains only one 
unknown, namely, хл. The final form of the equations will look like 


this: 
Ay K+ Aig Mt каҳ =b, 
арх; +..+а5Һ Xa =5 
auf 71 x, e by 077 


This process is called triangularisation. The number of primes indi- 
cate the number of times the coefficient has been modified. 


(Contd.) 


This can bo substituted back in the (n — 1)” equation to obtain the 
solution for х, _ 1. This back substitution can be continued till we get 
the solution for х. 


Note that the relation for obtaining the coefficients of the Ath derived 
system has the general form 


(7.7) 


where 

isk+1ton 

Ј = В+1ол 

aj" =a, fori=lton, j=lton 
The kth equation, which is multiplied by the factor a,, /ajy, is called the 
pivot equation and ay, is called the pivot element. The process of dividing 
the kth equation by d, /a,, is referred to as normalisation. 
Similarly, the relation for obtaining the Ath unknown ху has the general 


k-n $ h-1) 
bp- Ў ay (78) 
j=ki1 


" NM 
"ve the following 3 x 3 system using the basic Gauss elimination 
method. 


Br, + Oxy 63,7 16 , 
2x, + 4х, + Зх; = 13 / 
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After the first step of elimination using multiplication factor 2/3 and 13, 
we obtain the new system as follows: 


Зх, + 6x, + хз = M 
040472 7 
0 + 3x; + 5x, = 11 
At this point аз = 0 and, therefore, the elimination procedure breaks 
down. We need to reorder the equations as shown below: 
3x, + 6x, + xy = 16 
Эх, + Sx, = 11 
Тхз= 7 


Note that the process of elimination is complete and the solution is; 


xj21,x,72, and x; - 1 


Computational Effort 


Computational effort is one of the parameters used to decide the efficiency 
of a method. Here we estimate the computational effort required in 
terms of arithmetic operations. The number of operations required for 
eliminating x, from the equations below the Ath row are: 


Multiplications : (n ~- k + 1) (a — k) 
Subtractions : (n – k + 1) (n — k) 
Divisions ; (n - k + 1) 
The total operations required in Gauss elimination method is, therefore, 


n-l 
Multiplications = У, (n -k+ (a I inta? -» 
kal Б 
n-l 1 
Subtractions = У, (n- k+ 0(а-Ю= (п? -1) 
А-1 


Divisions = Y (n-k+1)=4n(n-1) 
РЕЙ 2 
For back substitution, we are evaluating the x values from x, to x4. 
For evaluating the value of x,, we require 
n-k multiplications 
n-k subtractions 
1 division 
Therefore, the total operations required for back substitution process 
are 


Multiplications = Y. (n- #) = iw -D 
k=l 


Subtractions = Y, ш@-Ю=ўп(л-1 
К 
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Divisions = Y 1-n 
Total operations — both the stages are given in Table 7.1. 
Table 7.1 Computational effort required 
Elimination process Substitution process Both stages 
(n- Dn (2n + 5) 


1 1 


— ГЕРИ " 
Multiplication — n(n? ~1) 320-0 © 
" i 2 2 (n~ Un (2n +5) 
Subtraction 3^ (n? - 1) 2^-0 NOTE 
) 
Division 4 pint) n ARTE 
2 2 


We can thus conclude that the number of multiplications and subtractions 
grows proportional to n*/3 and the number of divisions proportional to n*/2, 


Program LEG1 


The basic Gauss elimination technique enumerated in Algorithm 7.1 is 
implemented by the program LEGI. The driver program LEG1 uses a 
separate subprogram GAUSSI to implement the computational part of 
the algorithm. 

LEGI obtains the input data from the user and then calls the subpro- 
gram GAUSSI to solve the specified system of linear equations. It final- 
ly prints the results when they are received from the subprogram. 

The subprogram GAUSSI receives the details of the equation from 
the driver program, determines whether the pivot is zero or not, per- 
forms the elimination process (if it is not zero), computes x values (by 
back substitution), and finally sends the results to the driver program, 

Note that when the pivot value is near zero, appropriate message is 
sent to the driver to inform the user accordingly. 


* » 


* Main program 

* This program solves a system of linear equations 
* , using simple Gaussian elimination method 

A ass 

* Functions 'invoked 
+ NIL 


* Subroutines used 
* GAUSS1 
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* Variables used * 
* N - Number of equations in the system * 
* A - Matrix of coefficients * 
s B - Right siđe vector * 
* X - Solution vector ^ 

* 

. 


* Constants used 
Е STATUS - Solution status 


REAL A,B,X 
INTEGER STATUS,N 
EXTERNAL GAUSS1 
DIMENSION A(10,10), B(10), X(10) 
WRITE(*,*) 
WRITE(*,*) 'SOLUTTON BY SIMPLE GAUSS METHOD’ 
WRITE|*,*) 
WRITE(*,*) ‘What is the size of the system(n)?‘ 
READ(*,*) N 
WRITE(*,*) ‘Input coefficients a(i,j), row-wise, 
* 'one row on each line* 
ро 20 I = 1, N 
READ(*,*) (A(I,J),d=1,N) 
20 CONTINUE 


WRITE(*,*) ‘Input vector b’ 
READ(*,*) (B(I), I - 1, N) 


CALL GAUSSI(N,A,B,X, STATUS) 


IF(STATUS .NE. 0) THEN 
WRITE(*, *) 
WRITE(*,*) "SOLUTION VECTOR X' 
WRITE(*, *) 
WRITE(*,*) (X(T), I = 1, N) 
WRITE(*, *) 

ELSE 
WRITE(*,*) 
WRITE(*,*) "SINGULAR MATRIX, NO SOLUTION’ 
WRITE(*,*) ‘REORDER EQUATIONS’ 
WRITE(*, *) 

ENDIF 


STOP 
END 


- End of main program LEGI- 
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* Subroutine 
а This subroutine solves а set of п linear x 
* equations by Gauss elimination method ы 
Berar ee ==ренир rene ie as SESS UE * 
* Arguments * 
* Input * 
* N - Number of equations * 
Là A - Matrix of coefficients | 
Ы В - Right side vector ў 
* Output * 

X - Solution vector * 
* STATUS - Solution status * 
+ * 
* Local Variables 9 
* PIVOT, FACTOR, SUM 4 
оен ронро RENDERE Zi oT ^ 
* Functions invoked ^ 
* NIL Å 
-———— — oaa * 
* subroutines called М 
* NIL * 


REAL A,B,X,PIVOT,FACTOR,SUM 
INTEGER STATUS,N 
DIMENSION A(10,10), B(10), X110) 
TM Elimination begins --------------- * 


DO 33 K = 1, N-1 
PIVOT = А(К,К) 
IF(PIVOT .L’ 0.000001) THEN 
STATUS = 0 
RETURN 
ENDIF 
STATUS = 1 
DO 22 1 = К+1, N 
FACTOR = A(I,K)/PIVOT 
DO li J = Kei, N 
A(I,J) = A(I,J) - FACTOR * A(K,d) 
11 CONTINUE 
B(I) = B(I) - FACTOR * B(K) 
22 CONTINUE 
33 CONTINUE 


- Back substitution begins ------------ * 
X(N) = BIN) /AUN,N) 
DO 55 K = N-1,1,-1 

SUM = 0 

DO 44 J = K+i,N 
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SUM = SUM + A(X,J) * X(g) 
44 CONTINUE 
X(K) = (B(K) su 
55 CONTINUE 


)/A(K,K) 


RETURN 
END 


- End of subroutine GAUSSI ------- 


Test Run Results 


SOLUTION BY SIMPLE GAUSS METHOD 
What is the size of the system(n) 2 
3 
Input coefficients a(i,j), row-wise, one row on each line 
213 


o 


SOLUTION VECTOR X 
-5.000000E-001 -i.0000000 1.0000000 
Stop - Program terminated. 


M — Ó—Ó——————— — 
rad GAUSS ELIMINATION WITH PIVOTING 


In the basic Gauss elimination method, the element aj, when i = j ів 
known as a pivot element. Each row is normalised by dividing the coeffi- 
cients of that row by its pivot element. That is 
аз 
— јал 


ke 


(see Example 7.2). 

From the given set of equations, it is possible to reorder the equations 
such that a; is not zero. But subsequently, the values of ay, are contin- 
uously modified during the elimination process and, therefore, it is not 
possible to predict their values beforehand. 

The reordering of the rows is done such that a,, of the row to be 
normalised is not zero. There may be more than one non-zero values in 
the kth column below the element а,. The question is: which one of 
them is to be selected? It can be proved that roundoff errors would be 
reduced if the absolute value of the pivot element is large. Therefore, it 
is suggested that the row with zero pivot element should be interchanged 
with the row having the largest (absolute value) coefficient in that 
position. In general, the reordering of equations is done to improve accu- 
тасу, even if the pivot element is not zero. 
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The procedure of reordering involves the following steps: 
1. Search and locate the largest absolute value among the coefficients 
in the first column 
9. Exchange the first row with the row containing that element. 
3. Then eliminate the first variable in the second equation as cx- 
plained earlier 
4. When the second row becomes the pivot row, search for the coeffi- 
cients in the second column from the second row to the nth row and 
locate the largest coefficient. Exchange the second row with the 
row containing the large coefficient 
5. Continue this procedure till (п — 1) unknowns are eliminated. 
This process is referred to аз partial pivoting. There is an alternative 
scheme known as complete pivoting jn which, at each stage, the largest 
element in any of the remaining rows is used as the pivot. Figure 7.2 
illustrates the partial and complete pivoting strategies. Algorithm 7.2 
shows the implementation steps for partial pivoting. 
Complete pivoting requires a lot of overhead and, therefore, it is not 
generally used (though it may yield slightly improved numerical stabili- 
ty). 


Gauss eliminating with partial pivoting 


1. Input л, а, and b, values. 
2. Beginning from the first equation, 
(i) check for the pivot element 
(iy it it is the largest among the elements below it, obtain the 
derived system 
(iii) otherwise, identity the largest element and make it the pivot 
element 
(iv) interchange the original pivot equation with the one contain- 
ing the largest element so that 
the later becomes the new pivot equation 
(v) obtain the derived system 
(м) continue the process till the system is reduced to triangular 
form 
5. Compute x, values by back substitution 
4, Print results. 


Algorithm 7.2 | 
Bum 


Solve the following system of equations using partial pivoting tethnique 
Qe, + 2c; £39 2 6 
4x, + хз + 24 
XjQtXQ* Ху 
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Search area — 


(b) Complete pivoting 


Fig. 7.2 Pivoting strategies 


The forward elimination process using partial pivoting is shown below in 
tabular form. The process involves two steps of elimination and, in both 
the steps, the rows are interchanged. Note that the absolute value 
of -3/2 is greater than 1, 


Original system 2 2 1 6 Interchange 
© 2 3 2 


Modified original system 


First derived system 


Modified first derived system 
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Second and final derived system 4 2 3 4 


-32 14 -1 
-1/3 10/3 
The solution is Ea 
х=-10 
n=- 
pans ху=9 
Program LEG2 


Program LEG? is designed to solve a system of linear equations using 
Gauss elimination with partial pivoting. The modular structure of the 
program is shown in Fig. 7.3. 


Fig. 7.3 Modular structure of LEG2 


The master program LEG2, while reading data from the user and print- 
ing solution vector, depends on the services of the subprogram GAUS32 
for implementing the actual solution procedure given in Algorithm 72. 
GAUSS2, in turn, uses the services of two other subprograms, namely, 
ELIM, to perform forward elimination, and BSUB, to obtain the solution 
vector using the back substitution approach. 

The subprogram PIVOT undertakes the task of partial pivoting hy 
identifying the pivot element and then rearranging the rows such that 
the equation containing the pivot element becomes the pivot equation, 


to 


PROGRAM LEG2 


gram solves a system of linear equations 
ian elimination with partial pivoting 


Functions invoked 
NIL 


eee ee з + 


pea ee +» е 


20 


жже 
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Subroutines used ш 


Gauss2 


Variables used 
N - Number of equations 
A - Coefficients matrix 
B - Right side vector 
X - Solution vector 


Constants used 


INTEGER N 
EXTERNAL GAUSS2 
DIMENSION A(10,10), B(10), X(10) 


WRITE(*,*) 

WRITE(*,*) ' GAUSS METHOD WITH PARTIAL PIVOTING’ 
WRITE(*,*) 

WRITE(*,*) 'What is the size of the system(n)?' 
READ(*,*) N 

WRITE(*,*) ‘Input coefficients a(i,j), row-wise’ 
WRITE(*,*) ‘one row on each line’ 


ро 20 I = N 
READ(*, (A(I,J),J21,N) 
CONTINUE 


WRITE(*,*) ‘Input vector b’ 


READ(*,*) (B(I), I = 1, N) 


CALL GAUSS2(N,A,B,X) 
WRITE(*, *) 


WRITE(*,*) 'SOLUTION VECTOR Х' 


WRITE(*,*) 
WRITE(*,*) (X(I), I = 1, N) 
WRITE(*,*) 


STOP 


Subroutine 


This subroutine solves a system of linear 
equations using Gauss elimination method with 


partial pivoting 


> 


+ 


Arguments 
Input 
A - Coefficient, matrix 
В - Right Side vector 
N - Size of the system 
Output 
X - Solution vector 


Local Variables 
NIL 


Functions invoked 
NIL 


Subroutines called 
ELIM, BSUB 


REAL A,B, X 

INTEGER N 

EXTERNAL ELIM, BSUB 

DIMENSION A(10,10), В(10), %(10) 


Forward elimination 


CALL ELIM(N,A,B) 
Solution by back substitution 
CALL BSUB(N,A,B,X) 


RETURN 


Subroutine 
This subroutine performs forward elimination 
incorporating partial pivoting technique 


Input 
A - Coefficient matrix 
B - Right side vector 
N - System size 


À - Modified A 
B - Moditied в 


*o* жж жа 
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* Local Variables a 
* — FACTOR » 
* * 
* E 
* ап E 
А * 
*  PIVOT 
* aw S 
REAL А,В, X, FACTOR 
INTEGER N 
EXTERNAL PIVOT 
DIMENSION A(10,10),B(10) 
DO 33 K = 1, N-1 
CALL PIVOT (N,A,B,K) 
DO 22 I = Kel, N 
FACTOR А{Т,К)/А(К,К) 
DO 11 J = Kei, N 
А(Т,д) = А(1,7) - FACTOR * A(R,J) 
11 CONTINUE 
B(I) = B(I) - FACTOR * B(x) 
22 CONTINUE 
33  CONZINUE 
RETURN 
END 
* * 
" * 
SUBROUTINE PIVOT(N,A,B,K) 
aOR Ere RE a + 
* Subroutine вә 
This subroutine performs the task of partial * 
* pivoting Кеа. of equations) е 
m asana 4 
* Arguments * 
* Input * 
* N - System size * 
bi А - Coefficients matrix bi 
$ B - Right side vector * 
Li К - Row under consideration for pivoting 
* Qutput + 
n A - Modified A (after pivoting) * 
* B - Moditied B (af * 
* чё» > * 
* Local Variables ч 


s LARGE, TEMP, P * 
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se eee 


* 


11 


B 


22 


Functions invoked 
ABS 

Subroutines called 
NIL 


REAL LARGE, TEMP, A, B 
INTEGER P,N,K 

INTRINSIC ABS 

DIMENSION A(10,10), B(10) 


Find pivot P 


Peek 
LARGE = ABS(A(K,K)) 
DO 11 I = Ks], N 
IF(ABS(A(I,K)) .GT. LARGE) THEN 
LARGE - ABS(A(I,K)) 
Pr 
ENDIF 
CONTINUE 
Exchange rows P and К 
IF(P.NE.K) THEN 


DO 22 J = K,N 
TEMP = А(Р,Ј) 


A(P,J) = A(K,d) 
A(K,J) = TEMP 
CONTINUE 
TEMP - B(P) 
B(P) = B(K) 
B(K) = TEMP 


ENDIF 


Subroutine 
This subroutine obtains the solution vector X 
by back substitution 


Arguments 
Input 
N - System size 
A - Coefficient matrix (after elimination) 


B - Right side vector (after elimination) 


dium 
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* Output * 
€ X - Solution vector * 
* es pi а * 
* Local Variables * 
SUM * 

* 

* Functions invoked * 
* NIL * 
* * 
* Subroutines called * 
NIL * 


INTEGER N 

REAL A,B,X, SUM 

DIMENSION A(10,10), B(10), x{10) 

X(N) = ВОМ) /A(N,N) 

DO 55 К < №1, 1, -i 

SUM = 0 

DO 4¢ J = Kel, N 

SUM = SUM 4 A(K,J) * XW) 

44 CONTINUE 

X(K) = (BIK) SUM) /һ(к,к) 
55 CONTINUE 

RETURN 


Bren oes End of subroutine BSUB -- 


Test Run Results 


ЧОР WITE PARTIAL PIVOTING 
What is the of the system(n)? 
3 

Input 


onc 


+i),  row-wise 
or each 


SOLUTION 
5.0000000 1.0000000 -6.0000000 
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Ene GAUSS-JORDAN METHOD 


Gauss-Jordan method is another popular method used for solving a sys- 
tem of linear equations. Like Gauss elimination method, Gauss-Jordan 
method also uses the process of elimination of variables, but there is a 
major difference between them. In Gauss elimination method, a variable 
is eliminated from the rows below the pivot equation. But in Gauss- 
Jordan method, it is eliminated from all other rows (both below and 
above). This process thus eliminates all the off-diagonal terms producing 
a diagonal matrix rather than a triangular matrix. Further, all rows are 
normalised by dividing them by their pivot elements. This is illustrated 
in Fig. 7.4. Consequently, we can obtain the values of unknowns directly 
from the b vector, without employing back-substitution. Algorithm 7.3 
enumerates the Gauss-Jordan elimination steps. 


Ё аю ап |[ x1] 


bi] 
ал 42 аз || хг |=|b2 
m 
ан аг аз [fy] bi] 
0 а» ag || xo 


аз аз аз |! х» | 
d р. о ox] 
=j bi Jo тох 1-х 
0 0 ag јіхз | [55 10 0 Ix, bg 


Result of Gauss elimination Result о! Gauss-Jordan elimination 
ig. 7.4 Comparison of Gauss and Gauss-Jordan methods of elimination 


Fi 


- Normalise the first equation by dividing it by its pivot element. 

. Eliminate x, term from all the other equations. 

- Now, normalise the second equation by dividing it by its pivot element. 
Eliminate x, from all the equations, above and below the normalised 
pivotal equation. 

. Repeat this process until x, is eliminated from all but the last equation. 

. The resultant b vector is the solution vector. 


Algorithm 7.3 l 


Pena 


a 


2 


The Gauss-Jordan method requires approximately 50 per cent more arith- 
metic operations compared to Gauss method. Therefore, this method is 
rarely used- 
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x, + Зх. + х3 = 10 
2x. — 4x, 2x4 = -12 
using Gauss-Jordan method. 
Step 1: Normalise the first equation by 
The result is: 
ху + 2x — Bxy=-4 
xy + 3x3 + xg = 10 
2хү-4х,-2ху=-12 / 
Step 2: Eliminate x, from the second equation, subtracting 1 time the 
first equation from it. Similarly, eliminate x, from the third equa- 


tion by subtracting 2 times the first equation from it. The result 
is: 


хү+?җ -%у=-4 
Fag+ 4ху= 14 
0 - 8x, + 4х3=-4 

Step 3: Normalise the second equation. (Note that it is already in 


normalised form.) 
Step 4: Following similar approach, eliminate x, from first and third 
equations. This gives ^ 


zı +0- 11x, 7-32 + 
Ox,*4x,714 , \ 
0 + O + 36x, = 108 ) 
Step 5: Normalise the third equation = 
x,*0- Ixy = -82 
O + x, +4x5=14 
0+0 +23=3 
Step 6: Fliminate x; from the first and second equations, We get 
xy) +0+0=1 
0+х.+0= 2 
0+0+х3=3 


Computational Effort 
‘The Gauss-Jordan method requires only the elimination process. To elim- 
inate x, from all but the kth equation, we need to undertake the follow- 
ing tasks: 
1, Divide the coefficients xj, ;, x,,» -.. x, and b, by the coefficient of ху. 
2. Subtract suitable multiples of the kth equation from the otber 
(n — 1) equations to eliminate x, from these equations. 
These tasks require: 
(п-А+1 divisions 
(n —1)(л — k + 1) multiplications 
(n — 1) (n —Ё +1) subtractions 


230 Numerical Methods 


Therefore, the total operations required in order to complete the elimi- 
nation process are: 


Multiplications = Y (n—1)(n—&+1)=2n(n? —1) 
fet 2 
Subtractions = È (n+ D=2n(n? +1) 
kel 


Divisions = È (n-D(n-k+ D=Åa(n- 1) 
hel 


We see that the number of multiplications and subtractions is approxi- 
mately equal to (1/2) n? and the number of divisions is (1/2) n”. Compu- 
tational efforts required by the Gauss and Gauss-Jordan methods are 
given in Table 7.2. 


Table 7.2 Comparison of computational effort 


Ss Gauss method — + Gauss Jordan method) 
Multiplication dna l,s 
3 2 
Subtraction i^ 3 n3 
Divisions 


It shows that the Gauss method requires only two-third of the num- 
ber of multiplications or subtractions that the Gauss-Jordan method 


е TRIANGULAR FACTORISATION METHODS 


The coefficient matrix A of a system of linear equations can be factorised 
(or decomposed) into two triangular matrices L and U such that 


A=LU (7.9) 
where 
ly 0 . 0 
£s in 1а E 0 
а lan 
and 
Un Un o Uy 
U- Di Жи xs s 
0 0 u 
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Lis known as lower triangular matrix and U is known as upper triangu- 
lar matrix. 
Once А is factoriseu into L and U, the system of equations 
Ах =Ь 
сап be expressed as follows 
(LU) x=6b 
or 
L(Ur-b (7.10) 
Let us assume that 
Ux=z (7.11) 
where z is an unknown vector. Substituting Eq. (7.11) in equation 


(7.10), we get 
Iz=6 (7.12) 


Now we can solve the system 


Ax=b 
in two stages: 
1. Solve the equation 
Lz=b 


for z by forward substitution 
2. Solve the equation 
U,=2 
for x using z (found in stage 1) by bak substitution. 

The elements of L and U can be determined by comparing the ele- 
ments of the product of L and U with those of A. The process produces a 
system of n? equations with п? + л unknowns (l; and m) and, therefore, 
L and U are not unique. In order to produce unique factors, we should 
reduce the number of unknowns by n. 

This is doae by assuming the diagonal elements of L or U to be unity. 
The decomposition with L having unit diagonal values is called the 
Dolittle LU decomposition while the other one with U having unit diago- 
nal elements is called the Crout LU decomposition. 


Dolittle Algorithm 
We can solve for the components of L and U, given A, as follows: 
A=LU 
implies that 
ay aly uy tl кыщ fori <j (7.13) 


ay = Ty ty + hauyt—thjug fori=j (7.14) 
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aya ly uy slug. slyuy  fori>j (115) 
where шщ, = 0 fori > j and ly = 0 for i <j 


The Dolittle algorithm assumes that all the diagonal elements of L 
are unity. That is 
ы=1, 1,2, л. 


Using equations (7.13), (7.14) and (7.15), we can successively deter- 
mine the elements of U and L as follows: 


Ifisj 
m 
и Dla иу ј=1,2,..п 
кл 


where шц = ац, y; = Фу йз = 053 
Similarly, if i >j 


1 [ га! 1 
hy nu, roo 7 өй» | js1,2,.i-1 


where]; 71471421 and  l,-aj/ujfori-2ton. 

Note that, for computing any element, we need the values of elements 
in the previous columns as well as the values of elements in the column 
above that element, as illustrated in Fig. 7.5. This suggests that we 
should compute the elements, column by column from left to right within 
each column from top to bottom. 


1 ur thg 
2 иң 
3 
4 иа 
Row 
5 p 
6 ца 
|һ ün Un 
8 | bi k ko ы Б be | и 


Fig. 7.5 Pictorial view of Dolittle algorithm of LU decomposition 
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Algorithm 7.4 lists steps involved in LU decomposition and its applica- 
tion to the solution of lincur equations. 


‚ There is no need to store 1’s on the diagonal of L matrix. 
. There is also no need to store 0's of L or О. Consequently, 
the values of L can be stored in the zero space of U. 
3. Further, each element of. ау is used only once (and never 
used again). 
It is clear that we can “overwrite” A with L and U and save 
memory. This means “+e corresponding lj or ш can be stored in 
the location of aj. 


[^ Dolittle LU decomposition and solution 


1. Given n, А,Ь 

2. Set uz ay forj=1ton 
Set for /=1ton 
Set |, = а/ш, for i=2ton 

З. For each j = 2 to n do: 


(0 For i= 210] 
1-1 

Compute и, =ау - Y lau 
[ЕП 


Repeat / 
(il) Fori=j+1 ton 


1 


Compute J, 
Repeat / 
Repeat / 
4. Set 2, = b, 
5. Fori=2ton 


1-1 | 
-xja - кн | 
1 а 


1-1 
Set sum = Yljz; 
1 


7. Рогі= n-11o1 


Set sum= Yujx; 
js 
Set x= (z,— sum) / ш, 
Repeat / 
8. Write results 


Algorithm 7.4 
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Solve the system 
Зх + 2x, + xa 7 10 


z Bare ale 


Factorisation — 
Fori=1, l; = Land 
uu 70,73 
чы = ay, =2 
ugz0,-71 
Fori=2 


and  i;-1 


EUM иа =3-®х9=5 


3 
„4 
3 
Fori=3 
ay _1 
итв 
225 -lnu 
lay изу 
.2-M3x2 4 
|. 8/5 6 
lgz1 
ug - 7g Ша ~ lyg uas 
2444.24 
=34 Mich MIC 
Thus, we have 
1 0 0 
=|2/3 1 0 
V3 4/5 1 


3 2 1 
=|0 5/3 4/3 
0 0 24/15 
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Forward Substitution 
Solving Lz = b by forward substitution, we get 
z,=6,=10 


29 = by - ly 21 
= 14-23 x 10 = 22/3 
By why = ГУРУ 
214-3 x 10- 45x 22/3 =T 
Back Substitution 
Solving 10, = 2 Бу bach  bstitution, we get 


_ 27/15 
Хз 24/15 
DETTM 
hago 
_ 022/3) - (4/3) x 3 
Е 5/3 е 
x арша iic i 
un 
.10-2x2-1x3 | 
= 5 = 
Program DOLIT 


The Dolittle LU decomposition method for solving a system of linear 
equations may be implemented on a computer using the program DOLIT. 
The DOLIT program solves a problem with the help of two subprograms, 
LUD and SOLVE. 

The subprogram LUD decomposes the given coefficient matrix using 
the Dolittle algorithm and the resultant L and U matrices are supplied 
back to the main program DOLIT. Note that when it fails to decompose 
the matrix, a message to that effect is sent to the main program for 
necessary action. 

The subprogram SOLVE receives the right side vector B and the 
decomposed matrices L and U from the main program and then obtains 
the solution vector X employing both the forward and backward substi- 
tution techniques. 


* 


Main program 
This program solves a system of limear equations 
using Dolittle LU decomposition 


* Subroutines used 
* LUD, SOLVE 


* Variables used 

* N - System size 

* A - Coefficient matrix of the system 
* В - Right side vector 

* L - Lower triangular matrix 

* U - Upper triangular matrix 

* FACT - Factorisation status 


* Constants used 

bd YES,NO 

-—— — — 
INTEGER N,YES,NO,FACT 
REAL A,U,L,B,X 
EXTERNAL LUD, SOLVE 
PARAMETER( YES = 1, NO = 0 ) 
DIMENSION A(10,10),U(20,10),L(10,10),B(10),X(10) 
WRITE(*, *) 
WRITE(*,*) ‘SOLUTION BY DOLITTLE METHOD * 
WRITE(*,*) 

* Read input data 


WRITE(*,*) ‘What is the size of A?’ 
READ (*, *) N 
WRITE(*,*) ‘Input coefficients a(i,j), row-wise, 
+ ‘one row on each line’ 
po 10 T= 1, N 
READ(*,*) (A(I,J), J-1,N) 
10 CONTINUE 
WRITE(*,*) 'Input vector B on one line" 
READ(*,*) (B(I), I-1, N) 
* LU factorisation 


CALL LUD(N,A,U,L,FACT) 
IF( FACT .EQ. YES ) THEN 


* Print LU matrices 
* Print matrix U 
WRITE(*, *) 


WRITE(*,*) ‘MATRIX U’ 
DO 20 I = 1,N 
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WRITE(^,111) (U(I,J),J-1,N) 
20 CONTINUE 


* print matrix L 
WRITE(*,*) 
WRITE(*,*) ‘MATRIX L' 
DO 30 I-1,N 
WRITE(*,111) (L(I,J),J-1,N 
30 CONTINUE 
ELSE 
WRITE(*,*) 
WRITE(*,*) 'FACTORISATION NOT POSSIBLE’ 
WRITE(*,*) 
STOP 
ENDIF 
* Solve for X 
CALL SOLVE(N,U,L,B,X) 
WRITE(*,*) 
WRITE(*,*) 'SOLUTION VECTOR X' 
WRITE(*,*) 
,WRITE(*,111) (X(T), I-1,N) 
WRITE(*, *) 
111 FORMAT (3F15.6) 


STOP 


Subroutine = 
* This subroutine decomposes the matrix A into , 
*  L and U matrices using Dolittle algorithm » 
E E 
* Arguments А 
* input * 
* N - System size T 
* A - Coefficient matrix of the original system 
* Output е 
» U - Decomposed upper triangular matrix $ 
* L - Decomposed lower triangular matrix $ 
* FACT - Fact about decomposition (уез ог по) * 


* Local Variables 
* SUM : 
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* Functions invoked 
9 NIL 


* Subroutines called 


* NIL 
————————X MÁÁÁ—— M рк nie 


INTEGER N,YES,NO,FACT 

REAL A,U,L,SUM 

PARAMETER( YES = 1, NO = 0 ) 

DIMENSION A(10,10), U(10,10), L(10,10) 
* Initialise U and L matrices 


DO 1 I = 1,N 
= 1 


DO 1 J N 
U(I,J) = 0.0 
L(r,J) = 0.0 

1 CONTINUE 


* Compute the elements of U and L 


ПО 10 J а 1,N 
001,71) = ACL) 
10 CONTINUE 


DO 20 I = 1,N 
L(,I) = 1.0 
20 CONTINUE 


DO 30 I = 2,N 
L(1,1) = A(I,1)/0(1,1) 
10 CONTINUE 
DO 100 Т = 2,N 
DO 50 I = 2,0 
SUM = A(T,J) 
Do 40 K = 1,1-1 
SUM = SUM - L{I,K) * U(K,J) 
40 CONTINUE 
U(I,J) - SUM 
50 CONTINUE 


IF( U(J,J) .LE. 1.E-6 ) THEN 
FACT - NO 
RETURN 
ENDIF 
DO 70 I = J+1,N 
SUM - A(I,J) 
DO 60 К = 1,1-1 
SUM = SUM - L(I,K) * U(K,d) 
60 CONTINUE 
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L(I,J) = SUM/U(I,d) 
70 CONTINUE 
100 CONTINUE 


FACT = YES 


RETURN 
END 


End BE i subroutine LUD-------------- * 


Subroutine 
* his subroutine obtains the solution vector X 
Ф using the coefficients of L and U matrices 


* 

* 

* 

* 
* Arguments * 
* Input ж 
ы N - System size м 
* u - Upper triangular matrix ы 
* L - Lower triangular matrix Ы 
* B - Right side vector Ы 
* Output Y 
* X - Solution vector А 
. 
* 
* 


Local variates 
SUM, Z(vector) £ 


Functions 
NIL 


жаз ее 


NIL 


М 
М 

, " 

+ subroutines called 
А 

* 


INTEGER N 
REAL U,L,SUM,3,%,Z 
DIMENSION U(10,10),L(10,10),B(10) ,X(10), z(10) 


* Forward substitution 
2(1) = B1) 
DO 20 I = 2,N 
SUM 0.0 


DO 10 J = 1,1-1 
SUM = SUM + L(1,J) * Z(J) 
10 CONTINUE 
241) = B(I) - SUM 
20 CONTINUE 
* Back substitution 


X(N) = Z(N)/U(N,N) 


SUM = 0.0 
DO 30 J = I+1,N 
SUM = SUM + U(I,J) * Х(Ј) 
30 CONTINUE 
X(I) = (Z(1) - SUM)/U(I,T) 
40 CONTINUE 
RETURN 
END 
* we _ End of subroutine SOLVE ------ ------ * 


Test Run Results 
SOLUTION BY DOLITTLE METHOD 
What is the size of A? 


3 
Input coefficients a(i,j), row-wise, one row on each line 


аъ 


232 
13 3 
Input vector B on one line 
10 14 14 
MATRIX 0 
3.000000 2.000000 1.000000 
.000000 1.666667 1.333333 
.000000 .000000 1.600000 
MATRIX 1, 
1.000000 .000000 000000 
-666667 1.000000 ‚000000 
.333333 .800000 1.000000 
SOLUTION VECTOR X 
1.000000 2.000000 3.000000 


Stop - Program terminated. 


Crout Algorithm 

Another approach to LU decomposition is Crout algorithm. As men- 
tioned earlier, Crout decomposition algorithm assumes unit diagonal 
values for U matrix and the diagonal elements of L matrix may assume 
any values as shown below. 


lu 0б. ОЙТ xg = an] [ай ёа > ty 
in шз... OF 0 1 . ug -|n аљ +. а 


fa aw wd D GO |ы. йш ж Gag 
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We can use an approach that is similar to the one used in Dolittle 
decomposition to evaluate the elements of L and U. 


Cholesky Method 


In case A is symmetric, the 11] decomposition can be modified so that 
the upper factor is the transpose of the lower one (or vice versa), That is, 
we can factorise A as 


A=LL 
or 
A-U'U (7.16) 


Just as for Dolittle decomposition, by multiplying the terms of Eq. (7.16) 
and setting them equal to each other (see Eqs (7.13), (7.14) and (7.15)), 
the following recurrence relations can be obtained. 


Уи? @=1 - 


i 4 
e | aan 


if i-l T ү 
шу=——|а,- Xuyuy| @>® 
ket J 


mi 


This decomposition is called the Cholesky's factorisation or the method 
of square roots. Algorithm 7.5 lists the basic steps for computing the 
elements U, column by column. 


Cholesky's factorisation | 


1. Given л, A 


2. Set uy, = Jan 
8. Set uj s aj! Us foris2ton 
4. Forj=2ton 
Fori =2toj 
amat 
Forkettoi-1 
Sum = sum = Uy Uy 


Repeat k 
set Uy = SUM / Ug ití«j 
set и, = sum iti=j 
Repeat / 
Repeat j 


5. End of factorisation 
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Factorise the matrix 


12 3 
2 8 22 
3 22 82 
using Cholesky’s algorithm _ _ 
Fori=1, according Eq. (7.17) 
uy, =V1=1 
an 2 
„=-!#®=®=2 
ы Sn 1 
EP 
=F 3 
Fori=2 
шае 823 узип 
ир 
Богі = 3 


изз = as ~ 42, -u 
= 4829—64 = V9 =3 


Thus, we have 


3 


1. 2 
=|0 2 
0 0 


ada 


7 И ROUNDOFF ERRORS AND REFINEMENT 


In all the direct methods, only one estimate of x, is produced. As we 
know, methods use a large number of floating point operations and, 
therefore, introduce roundoff errors in the final solution. We have no 
indication how accurate the solution is. 

One way to check this is to substitute the answer back into the 
original equations to see whether a substantial error lias occurred. In 
case the error is beyond the acceptable limit, the solution can be im- 
proved by a lechniqve known as iterative refinement. 
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Let us suppose х'!'! is the solution of the system 
Ax-b 
Substituting x” back in the original equation, we get 
Ax = у 
^! is not exact, b' is not equal to b. If we define 
ШЕГШЕ 


Since x' 


then we have 
r? z AX 1 (7.18) 
where r is known as residual vector. If we can use this information to 


compute the error, then we can correct the approximate solution with 
this error. 


If we assume that x* is the exact solution and e is the error in x, then 
x* = 60 
ог 
ax se (1.19) 
Substituting this in Eq. (7.18), we get 
бш А (eh е) 


= Ах* + Ag b 
Since Ax*= b, this results in 
PES (7.20) 


We can now obtain e” 


improved solution as 


hy solving Eq. (7.20) and then estimate the next 


dhasaa 
If we need further improvement, we can repeat the process by calculat- 
ing e using 
Аг — 2 
where 


т = Ax? b 
We get the next estimate as 
a =х® um 


This process can be repeated as many times as we wish to achieve a 
desired accuracy. Algorithm 75 lists the steps for implementing the 
iterative refinement process. 


Iterative refinement 


1. Obtain LU factorisation of A 
2. Compule the solution x by forward and back substitutions 
3. Find the residual vector r using 

r-Ax-b 
4. Compute the error using 


=r 
by forward and back substitutions 
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5. Setx=x-e@ 
6. It e is sufficiently small 


stop 
otherwise 
go to step 3 


Е ILL-CONDITIONED SYSTEMS 


As pointed out in the beginning of the chapter, arriving at a proper 
solution depends on the condition of the system. Systems where small 
changes in the coefficient result in large deviations in the solution are 
said to he ill-conditioned systems. A wide range of answers can satisfy 
such equations. This means that a completely erroneous set of answers 
may produce zero (or near zero) residuals. This is illustrated in Example 
л. 

Tll-conditioned systems are very sensitive to roundoff errors. These 
errors during computing process may induce small changes in the coeffi- 
cients which, in turn, may result in a large error in the solution. 

‘We can decide the condition of a system either graphically or mathe- 
matically. Graphically, if two lines appear almost parallel, then we can 
say the system is ill-conditioned, since it is hard (о decide just, at which 
point they intersect. 

The problem of ill-condition can be mathematically described as fol- 
lows: consider a two equation system 


ag X +ах = b 
вз ху + ag Xo = ba 


If these two lines are almost parallel, their slopes must by nearly equal. 
That is 


Alternatively, 


ацаз = 01:025 


ог 
—w-—: 740^ ри. 
Lomas men © 9j 


Note that à; 235 — а ар is the determinant of the cyefficient matrix 


A [t а] 
[an an 
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This shows that the determinant of an ill-conditioned system is very 
small or nearly equal to zero, 

In partial pivoting technique, we try to interchange the rows so that 
the largest element becomes the pivot element. This is done basically to 
avoid a division by zero or nearly zero point. Even the largest element in 
that column may happen to be zero (or nearly zero). Such situations 
arise when the systems are ill-conditioned. Solution of these systems 
may not be meaningful. 


Solve the following equations 
21, + х = 25 
2.0015, + x; = 25.01 
апа d thereby discuss the effect of ill- conditioning. 


235x1-9501x1- 
2x1-2001x1 


_ 25.01 х2 = 25 х 2.001 = 

харх C 
Let us change the coefficient of x, in the second equation to 2.0005. Now 
the values of x, and x; are 


25-2501 , 
*172-2,0005 7 

.2501x2-25x20008 15 
a 2- 2.0005 


Compare the results. A small change in one of the coefficients has re- 
sulted in a large change in the result. 
If we substitute these values back into the equations, we get the residuals 
ав 
= 40 – 15 – 25 =0 
л = 40.02 — 15 — 25.01 = 0.01 
‘The first equation is satisfied exactly and the residual of the second is 
small. It appears as if the results are correct. This illustrates the effect 
of roundoff errors on ill-conditioned systems. 


we MATRIX INVERSION METHOD 


Another way to obtain the solution of an equation of type 
Ar=b (7.21) 


246 Numerical Methods 


is by using matrix algebra. Multiply each side of Eq. (7.21) by the 
inverse of A. This yields 


ATAx=A1S (7.22) 
since Аг! A = I, the identity mat. ation (7.22) becomes 


AB (7.23) 


Equation (7.23) gives the solution for x. 
This approach becomes useful when we need to solve Eq. (7.21) for 
different sets of b values while A remains the same. 


Computing Matrix Inverse 


Although the Gauss-Jordan method is more complicated compared to 
Gauss elimination method, this method provides a simple approach for 
obtaining the inverse of a matrix. 
This is done as fullows: 
1. Augment the coefficient matrix A with an identity matrix ав shown 
below: 


ац ар aGg:l 0 0 
an аш а ‘010 
аһ am аз!0 0 1 


2. Apply the Gauss-Jordan method to the augmented matrix to re- 
duce A to an identity matrix. The result will be as shown below: 


1 0 Oiaj ais ац 
0 1 0:a5 ah aj 
10 0 ilah ар аз] 
The right-hand side of the augmented matrix is the inverse of A. Now, 
we can obtain the solution as follows: 
x*,-auj xb, + ig xb, +аң xb, 
Xo =ађ хь +а x by tais x bs 


хз =аң xb, +а% xb, каф x ba 


Condition Number 


The inverse matrix can also be used to decide whether a system is ill- 
conditioned, Let us define a matrix C as 
C=A.A1 (7.24) 


If C is close to identity matrix, then the system is well-conditioned. If 
not, it indicates ill-conditioning. 
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Equation (7.24) can be expressed using the concept of matrix norm as 
follows: 


cond (А) = | JA] | - | JA? || (7.25) 


where cond(A) is called the condition number and || А || is the “norm” 
of the matrix A. The norm is defined as follows 


ПА max. Ea, | 


This is known as row-sum norm. In this norm, the sum of the absolute 
values of the elements for each row is computed and the largest of these 
is taken as the norm. 

The smaller the condition number, the better is matrix A suited to 
numerical computation, 


TERR SUMMARY 


Tn this chapter we studied systems of linear equations. Among Ше two 
popular approaches available for solving these equations, we considered 
the elimination (also known as direct) methods in detail. They includo: 

+ Gauss elimination method (basic) 

+ Gauss elimination with pivoting 

- Gauss-Jordan method 

+ LU decomposition method using Dolittle algorithm 

+ Matrix inverse method 
We also stated that other LU decomposition techniques, such as Crout 
algorithm and Cholesky’s factorisation, may be applied to solve the equa- 
tions. 

Direct methods introduce roundoff errors. We presented an iterative 
refinement procedure for improving the final result. 

Computer programs with test results have been given for the follow- 
ing methods: 

+ Basic Gauss elimination method 

+ Gauss elimination with partial pivoting 

- Dolittle LU decomposition method 


Back substitution Lower triangular matrix 
Basic Gauss’ elimination LU decomposition 
Cholesky's algorithm Matrix inversion 
Cholesky's factorisation Matrix norm 

Complete pivoting Method of square roots 
Condition number Modular structure 
Crout algorithm Nonlinear 

Crout LU decomposition Normalisation 


(Contd.) 
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(Сота) 

Decomposition Over-determined 

Dependent equations Partial pivoting 

Direct method Pivot element 

Dolittle LU decomposition Pivot equation 

Elimination approach Pivoting 

Forward elimination Residual vector 

Gauss elimination Row-sum norm 

Gauss-Jordan method Simultaneous equations 
eous equations Singular systems 

lil-conditoned system Triangularisation 

Inconsistent equations Under-determined 

Infinite solutions Unique solution 

Iterative refinement Upper triangular matrix 

Linear Zero residuals 


и the two basic approaches that are employed for solving a 
system of linear equations. 
wt. What are the four possible solution conditions of a system of linear 
equations? Explain each one of them with an illustration. 
3. Explain under-determined and over-determined systems. 
4,-What is meant by homogenous equations? 
4. State some basic rules that are used in the elimination method of 
solving simultaneous linear equations. 
«c Explain the basic concepts used in the Gausa elimination approach 
7. What is triangularisation of equations? How does it help obtain the 
solution? 
8. What is pivoting? Distinguish between partial pivoting and com- 
plete pivoting. 
9. How does pivoting improve accuracy of solution? 
10. Compare critically Gauss elimination and Gauss-Jordan methods 
of solving simultaneous equations 
11. Show that Gauss-Jordan takes about 50% more operations than 
Gauss elimination for the case of three equations. 
12, What is Dolittle decomposition? How is it different from Crout de- 
composition? 
13. What is Cholesky's factorisation? 
14. What is iterative refinement? How is it used to improve the accura- 
cy of results? 
MÉ. What is meant by ill-conditioned systems? 
16, Can we solve an ill-conditioned system? If yes, how? 
17, What is condition number of a system? How is it computed? 
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Га a 


4 
ay olve the following twstem of equations using simple eliminatio. 
фәр! в: 
Lo ш х+у+г= 6 
2r-y432- 4 


4x + бу – 102 = 13 
Show that the following system of equations has no solution. 
“F +у+а-12 of la. $265 >opla % 


& p ska U Tm Ravn 
* Show that the following system of equations has infinite number o? 
solutions. 
х+у+г= 20 V 
2х-Зу+г=—5 
3x -2y +22 = 15 | 
J ' Polve the following systems of equations by simple Gauss elimina- 
a tion. 


Br, + Bry + Ary = 5 

eM Ar + dxa +x, -6 
* ail om, =7 
(b) 2r, + 8x, 44x, =Б 

Зк + 45x, + 5х, = 6 


Ax, + x; + бху = 
Xi + 2x + 35) =8 
2x, + dxa + Ox, =8 


JL 4x, + + 2x, -2 
5. Solve the systems in Exercise 4 using partial pivoting. 


sets of equations. 
(a) Zu +2 + xa- 22, =0 
4x, *Zx 4x zB 


8rn4254230 —-7 

Xp Bx, 2r. =3 
(b) x, (x, ~ 2x, =3 

Ax, ~ 2x x; Ei 


3x – х2 + 3ra = 
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“8. Solve the following systems of equations by Gauss-Jordan method 
x + 2х. – Sry = 4 
NN 2x, + 4x, — бхз = 8 
SS хр- 2x, + 0x, =4 
(b) 2r; 4x?4X9 =7 
4x, + Dey + x = 4 
Xj X) X, =0 
9. Find the Dolittle LU decompositions of the coefficient matrices of 
the systems in Exercises 7 and 8. 
10. Solve the systems in Exercises 7 and 8 using the matrices L and U 
4 found in exercise 9 by forward and backward substitutions. 


Find the Cholesky decomposition of the matrix 


4 11 
15 2 
13259 


12. Find the inverse of the following matrices using Gauss-Jordan elim- 
ination technique 


2 34 1 2 -8] 
(а) j4 2 3 (|2 4 -6 
342 i22 3 


18. Find the condition numbers of the coefficient matrices of systems in 
Exercise 4. 

14. Consider the following electrical network connecting six resistors 
and two batteries: 


Ohm's law states that the voltage across a resistor equals the cur- 
rent through it multiplied by its resistance. Using this law, we can 
set up the following equations: 

Rely + ВЗ, — Ip) + RAI,- Ig) = V, 

Rly + Belle ~ 19 + Ry, - 1) = V, 

Ril; + Ry; — 1) + Rol, - I) = 0 


15. 
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Assuming R, = R, = Ё = 2, R, = R, = Ё = 3 and V, = V3 = 5, Solve 
the system of equations for currents /,, Г, and I4 using Gauss elim- 
ination өт Gauss-Jordan method. 

A company produces four different products. They are processed 
through four different departments A, В, C and D. The table below 
gue the number of hours that each department spends on each 
product 


, - 
Department. | = Producis 

[Fi Р, Ps P, 

DI | 2 3 Xx | 3 

D2 H 2 2 4 

D3 | 3 4 4 5 

D4 3 2 2 3 


Total production hours available each month in each department is 
as follows: 


| Department 


Hours 


Formulate the appropriate system of lincar equations to determine 
the quantities of the four products that can be produced in each 
month, so that all the hours available in all departments are fully 
utilised. Determine how much time each department spends for 
each product. 


. Program LEG2 solves a system of linear equations using Gauss 


elimination with partial pivoting. Modify the program to imple- 
ment complete pivoting. 


. Develop a program to factorise a matrix using Cholesky's algo- 


rithm, 


. Design and develop a program to implement the Gauss-Jordan elim- 


ination method for solving a system of linear equations. 


. Write a program to implement the Crout decomposition solution of 


linear equations. 


. Construct a program to implement the iterative refinement process 


as given in Algorithm 7.6. 


Iterative Solution of 
Linear Equations 


Fd NEED AND SCOPE 


Direct methods discussed in the previous chapter pose some problems 
when the systems grow larger or when most of the coefficients are zero 
They require prohibitively large number of floating point operations 
and, therefore, not only become time consuming but also severely affect 
the accuracy of the solution due to roundoff errors. In such cases, itera- 
tive methods provide an alternative. For instance, ill-conditioned sys 
{етв can be solved by iterative methods without facing the problem of 
roundoff errors. 

"The following three iterative methods are discussed in this chapter: 

4G Jacobi iteration method 
С Gauss-Seidel iteration method 

3. Successive over relaxation method 
Like all other iterative processes, these methods introduce truncation 
errors and, therefore, it is important to understand the magnitude of 
this error as well as the rate of convergence of the iteration process. 


СЕ JACOB! ITERATION METHOD 


Jacobi method is one of the simple iterative methods. The basic idea 

behind this method is essentially the same as that for the fixed point 

method discussed in Chapter 6, Recall that an equation of the form 
fix)=0 

can be rearranged into a form 
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The function g(x) сап be evaluated iteratively using an initial approxi- 
mation x as follows: 


FEET for i=0, 1, 2... 


Jacobi method extends this idea to a system of equations. It is a direct 
Substitution method where the values of unknowns are improved by 
substitutiag directly the previous values. 

o us consider a system of n equations in n unknowns. 


ац Xp + yg ty +... +O yy X, = b, 


y Xp Oy ty +... + gy x, = by Ў 


Fd 


3 & 
ам Xy + а Xo +... + Ong х, = D, A 


e 


(8.1) 


We rewrite the original system as 


м^ 


ИЛ = (x, +ауху + 
ац 


+ах„) 
x eeni 


bo —(азх +азху +... + aan xn) 
xQ2—— H1 en a 
" an 


(8.2) 


(аах + Ogg ty +. Жад) 


^ 
ам 


x, 


Now, we can compute Xy Xy. x, by using initial guesses for these 
values. These new values are again used to compute the next set of x 
values. The process can continue till we obtain a desired level of accu- 
racy in the x values. 

In general, an iteration for х; can be obtained from the ith equation as 
follows 


b, -(a x) ca, yx? +а е И se 
a сы" ш-1^,-1 diclo iul * i Dax, 
ares (8.3) 
ay 


The computational steps of Jacobi iteration process are given in 
Algorithm 8.1. 
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Jacobi iteration method 


Obtain n, a, and b, valuc_. 


D 

2. м bla; tor i= 1. n 
3. Set 0 

4 


For 
(i) 
dil) 


Set sum = sum — a; Xoj 
Repeat j 
(iii) Set x, = sum/a, 
(iv) if key = 0 then 
Xi-X 
: E > error then 


set key = 1 
Repeat í 
5. If key = 1 then 
sel Xo = X, 
go to step-3 


6. Write results 
4 a. ae 
M 


м2 


Obtain the solution of the following system using the Jacobi iteration 
method 


2x, XQ + X 5 
Bx, + Sry + 2x3 = 15 
_2х„+х +4 = В 


First, solve the equations for unknowns on the diagonal. That i is 


5-xj-x 
wy oa. 
15- 2x. 
m Л " 
8-22 -X2 
Aa-—À 


If we assume the initial values of x), x; and x4 tc be zero, then we get 
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(Note that these values are nothing butx! = b/a,) 
For the second iteration, we have 
5-3-2 
T—- 
а - 
а) 15-3х2.5-2х2 3 24 
„йылы айыы ы 


(2) 
х2 2 0 


„215-880-200 


8 
E ке E ius 

After fourth iteration, 
209 „8-84-18 о 


i 2 
б) _ 15-3«2.15-2х1.825 
gu шш 
w 8-2x215-3 


с ^ =0.175 YQ 


‘The process can be continued till the values of x reach a desired level of 
accuracy. 


= 1225 


Program JACIT 


The program JACIT solves a system of n linear equations using the 
Jacobi iteration method as detailed in Algorithm 8.1. The main program 
reads interactively the system specifications and displays the results on 
the screen. The solution algorithm is implemented through the subrou- 
tine JACOBI. 
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The subprogram JACOBI, while computing the solution vector X, tests 
for the accuracy as well as the convergence. The computing process 
stops either when the desired accuracy is achieved or when the process 
does not converge within a sp..... * .umber of iterations. 


PROGRAM JACIT 


* 
* Main program * 
* This program uses the subprogram JACOBI to solve * 
* a system of equations by Jacobi iteration method * 
* 

- 

* 

* 

* Subroutines used Ы 
* — JACOBI * 
* 

* Variables used Ы 
* А - Coefficient matrix * 
» B - Right side vector * 
* N - System size * 
* X - Solution vector * 
* COUNT - Number of iterations completed Ы 
* STATUS - Convergence status * 
^ 

* Constants used 

© EPS - Error bound * 
* MAXIT - Maximum iterations permitted * 


REAL A,B,X,EPS 

INTEGER N,COUNT,MAXIT, STATUS 
PARAMETER(EPS-0.000001,MAXIT - 50) 
DIMENSION A(10,10), B(10), X(10) 


WRITE(*,*) 

WRITE(*,*) ‘SOLUTION BY JACOBI ITERATION‘ 
WRITE(*, *) 

WRITE(*,*) ‘What is the size of the system(n)?' 
READ(*,*) N 

WRITE(*,*) ‘Input coefficients a(i,j), row-wise', 
WRITE(*,*) ‘one row on each line’ 


DO 20 I = 9, N 
READ(*,*). (A(I,J),J-1,N) 
20 CONTINUE 


WRITE(*,*) ‘Input vector b‘ 
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READ(*,*) (B(I), I = 1, N) 
CALL JACOBI (N,A,B,X,EPS, COUNT, MAXIT, STATUS) 
IF(STATUS .EQ.2) THEN 


WRITE(*, *) 
WRITE(*,*) ‘NO CONVERGENCE IN’, MAXIT, 
‘ITERATIONS’ 

WRITE(*, *) 

ELSE 
WRITE(*,*) 
WRITE(*,*) ‘SOLUTION VECTOR X' 
WRITE(*, *) 
WRITE(*,*) (X(I), I = 1, N) 
WRITE(*,*) 
WRITE(*,*) ‘ITERATIONS = ',COUNT 
WRITE(*, *) 


SUBROUTINE JACOBI (М,А, B, X, EPS, COUNT, MAXIT, STATUS) 

Subroutine * 
This subroutine solves a system of n linear 
equations using the Jacobi iteration method = 


Arguments = 
Input в 
М - Number of equations id 

А - Matrix of coefficients of the equations * 

В - Right side vector T 
EPS - Error bound ж 
МАХІТ - Maximum iterations allowed е 
Output d 
X - Solution vector * 
COUNT - Number of iterations done Ы 

* 


STATUS - Convergence status 


Local Variables ы 
XO, SUM * 


Subroutines called 
NIL 


INTEGER N,KEY, COUNT, MAXIT, STATUS 
REAL A,B,X,XO,EPS 

DOUBLE PRECISION SUM 

INTRINSIC ARS 

DIMENSION A(10,10),8(10),X(10),XO(10) 


* Initial values of X 
DO 10 I-1,N 
KO(I) = B(I)/A(I,I) 
90 — CONTINUE 
COUNT - 1 
99 KEY = 0 
* Computing values of X(I) 
DO 30 I = 1,N 
SUM = BII) 
DO 20 J - 1,N 
TF(I.50.J) GOTO 20 
SUM = SUM - A(I,J) * XO(J! 
20 CONTINUE 
X(1) = SUM/A(T,T) 
IF(KEY .EQ. 0) THEN 
* Testing for accuracy 
IF(ABS((X(I) XO(I))/X(1)) .GT. EPS) THEN 
KEY = 1 
ENDIF 
ENDIF 
30 CONTINUE 
TF(KEY.FQ.1) THEN 
* Testing for convergence 
IF(COUNT .zQ. MAXIT) THEN 


STATUS - 2 
RETURN 
ELSE 

STATUS - 1 


DO 40 I = 1,N 
XO(I) = X(1) 
40 CONTINUE 
ENDIF 
COUNT = COUNT+1 
GO TO 11 


ENDIF 
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Test Run Results The program was used to solve the following system of 
equations: 
Br, +x) =5 
x; - x, 
The interactive computer output is given below: 


SOLUTION BY JACOBI TTERATION 
What is the size of the system(n)? 


Input coefficients ali 
one row on each line 
31 

3 «3 

Input vector b 

55 

SOLUTION VECTOR X 
2.0000000 -9.999998Е-001 
"IONS - 14 


), row-wise 


Program terminated. 
Now, rearrange the equations as shown below and then use program 
JACIT to solve the system. 

x- 8x, = 5 

x, + x)= 6 
The output now is as given below: 

SOLUTION BY JACOBI I'TERATTON 
What is the size of the system(n)? 


input coefficients a(i, jl, row-wise 
one row on each line 

1-3 

34 

Input b 

NO CONVERGENCE IN 50 ITERATIONS 
Stop - Program terminated. 


Note that the same two equations, when their positions are interchanged, 


do not produce required results even after 50 iterations. Convergence is 
discussed in Section 8.5, 


[8] GAUSS-stIDEL METHOD - 


Gauss-Seidel method is an improved version of Jacobi iteration method. 


In Jacobi method, we begin with the initia] values 
BOR 


and obtain next approximation 
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m 0 w 
жү aE 


Note that, in computing z$”, we usedx( and notx{” which has just 


been computed. Since, at this point, bothx!” andx{\” are available, we 


w 


can usex;” which is a better approximation for computing x‘), Simi- 
1 2 


larly, for computing x}", we сап usex|" andx\” along with AP ca sm. 


This idea can be extended to all subsequent computations. This ap- 
proach is called the Gauss-Seidel method. 

The Gauss-Seidel method uses the most recent values of x as soon as 
they become available at any point of iteration process. During the (k+1)th 
iteration of Gauss-Seidel method, x, takes the form 


m mm "m m 
"TONO (апат +..-+04-101-1 Жашо аца ) 
„ = —| (84) 


1 
i Um 
When i = 1, all superscripts in the right-hand side become (k) only. 


Similarly, when i = n, all become (k + 1), Figure 8.1 illustrates pictorially 
the difference between the Jacobi and Gauss-Seidel method. 


= (ах аз а, 
(b.- dey X1 аз хуа eration 1 
(bs — аң X, = а„ хаз 


à LALA LPS 
% = (blaax casses oration 2 


% = (b~ an х ар хаз 21] 
(a) Jacobi method 


® = (by = aye X- ayy уа, 
= (bp - ay X ~ аз куа, Meration 1 
= (by ayy XI — аз уам 


= (5-85 аз 3а, Негайоп 2 
= (6-а х- Bag хаз 


(0) Gauss-Seidel method 
Fig. 8.1 Comparison of Jacobi and Gauss-Seidel methods 


[3] 
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Jbtain the solution of the following system using Gauss-Seidel iteration 
method 


m 


2ху+ху+ху =5 
Эх, + 5x, + 2x, = 15 


xy = (5 = xq - xgV/2 
xq = (15 - 8x, — 2/5 
xy = (8- 2x, - x94 
Assuming initial value as x, = 0, x; = 0, and ху = 


Iteration 1 x, =(5-0- 0y2 225 
ay = (15-3 х 25— 0/5 215 
X4: (8-2 25 — 1.514 = 0.4 (rounded to one decimal) 
Iteration 2. x, = (5 - 1.5 - 0.402 =16 
192(18-3«16-2x0.)5 =1.9 
хз=(8-2х16-1.9у4 20.7 
We can continue this process until we get x, = 1.0, x; = 2.0 and xy = 1.0 


(correct answers) 


Algorithm 
Gauss-Seidel algorithm is a simple modification of the algorithm of the 


Jacobi method. Note that once a new value ofz/^ ' " has been calculated 


and compared with the previous values of x\*!, the previous value is no 
longer required and, therefore, the previous value can be replaced by the 
new one. This implies that we need not use two vectors (one to store 
previous values and another to store new values) for storing x values. 
We need to use only one vector x that stores always the latest values of 
x, This is illustrated in Algorithm 8.2 


Gauss-Seidel method 


Obtain n, a; and b, values 
. Set x, = b/a; foriziton 
Set key = 
. Foris1ton 
(i) Set sum = 5, 
(i) Рог/= 1 юп (/ ù 
Set sum = sum — a, ху 
Repeat j 


sans 


(Contd.) 
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(Contd.) 


(i) Set dummy = sum / a, 
(iv) If key = 0 then 


dummy - x, 
it ed > error then 
| dummy 


set key = 1 
(v) Set х= dummy 
Repeat / 
5. If key = 1 then 
go to step 3 
6. Write results 


Program GASIT 


Like JACIT, the program GASIT also solves a system of n linear equa- 
tions but employs the Gauss-Seidel iteration method as detailed in Algo- 
rithm 8.2. The iteration algorithm is implemented with the help of a 
subprogram called GASEID. 


PROGRAM GASIT 


fx ——— йыла — 
* Main program * 
* This program uses the subprogram GASEID to solve a * 
F system of equations by Gauss-Seidel iteration method * 
* Functions invoked le 
* NIL * 
e E e RE ERE RE E en a NT & 
* Subroutines used Ы 
* — GASEID * 
* Variables used * 
* А - Coefficient matrix * 
* — B - Right side vector * 
* N- System size * 
Ы Х - Solution vector Ы 
* COUNT - Number of iterations completed * 
* 


STATUS - Convergence status * 
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* Constants used * 
" EPS - Error bound * 
MAXIT - Maximum iterations permitted * 


REAL A,B,X,EPS 

INTEGER N,COUNT,MAXIT,STATUS 
PARAMETER(EPS-0.000001,MAXIT-50) 
DIMENSION A(10,10), B(10), X(10) 


WRITE(*,*) 

WRITE(*,*) 'SOLUTION BY GAUSS-SEIDEL ITERATION' 
WRITE(*,*) 

WRITE (* ‘What is the size cf the system(n)?' 
READ(*,*) N 

ЖИТТЕ (*, *) "Input coefficients a(i,j), row-wise' 
WRITE(*,*) ‘one row on each line’ 


DO 20 I = 1,N 
READ(*,*) (A(I,J),J-1,N) 
20 CONTINUE 
WRITE(*, *) "Input vector b' 
READ(*,*) (B(t), I = 1, № 


CALL GASEID(N,A,B,X, EPS, COUNT, MAXIT, STATUS) 


.ЕО. 2) THEN 
*) 
.*) ‘NO CONVERGENCE IN’, MAXIT, 
‘ITERATIONS’ 
WRITE (*,*) 
ELSE 
WRITE (*, *) " 
WRITE(*,*) 'SOLUTION VECTOR X' 
WRITE(*,*) 
WRITE(*,*) (X(T), I = 1, N) 
WRITE(*,*) 
WRITE(*,*) 'ITERATIONS - ', COUNT 
WRITE(*,*) 
ENDIF 
STOP 


END 
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* Subroutine 
* This subroutine solves a system of linear 
* equations using Gauss-Seidel iteration algorithm 


* Arguments 
* Input 

* N - Number of equations 
ыд A - Coefficient matrix 

* B - Right side vector 

* EPS - Error bound 

* МАХТТ - Maximum iterations allowed 

* Output 

* X - Solution vector 

ш COUNT - Number of iterations done 
* STATUS - Status of convergence | 


* Local Variables $ 


* 


» DUMMY, SUM, KEY * 
Ф эзе EL ANDEREN * 
* Functions invoked * 
d ABS * 
рро T EE EL Li a 
* Subroutines called * 
» NIL * 


INTECER N,KEY, COUNT, MAXIT, STATUS 

RFAL A, B, X, EPS, DUMMY 

DOUBLE PRECTSION SUM 

DIMENSION ^(10,10) ,B(10),x (10), xo(10) 
INTRINSIC ABS 


* Initial values of x 4 
DO 10 I = 1, 
X(T) = BII) у AG, I) 
10 CONTINUE 


. COUNT = 1 
11 KEY = 0 


* Computing X(I) values 
DO 30 I = LN 
SUM = B(I) 
DO 20 J = 1N 
IF(I.EQ.J) GOTO 20 
SUM = SUM - А(Т,7) * x(gj 
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20 CONTINUE 


DUMMY = SUM/A(I,T) 
IF(KEY .EQ. 0) THEN 


* Testing for accuracy 
IF(ABS[(DUMMY - X(1))/DUMMY) .GT. EPS) ‘THEN 
KEY = 1 
ENDIF 
ENDIF 
X(1) - DUMMY 


30 CONTINUE 


TF(KEY .EQ. 1) THEN 
* Testing for convergence 


IF (COUNT 
STAT 
RETURN 


-Е0. MAXIT) THEN 
2 


= COUNT + 1 
бото 11 
ENDLF 
ENDI 
RETURN 
END 
*------------ End of subrout GASEID------ T * 


Test Run Results The program was used to solve two different sets of 
equations and the results are as follows: 


SOLUTION BY GAUSS-SEIDEL ITERATION 
What is the size of the system(n)? 


Input Coetfic 
one row on sach 
113 

447 

259 

Input vector b 
113 

SOLUTION VECTOR X 

-4.999992E-001 -9.999992Е-001 9.999993E-001 


ITERATIONS - 38 
Stop - Progr 


ts a(1,j), row-wise 
line 


termi; cd 
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Second sec 
SOLUTION BY GAUSS-SEIDEL ITERATION 

What is the size of the system(n) 
3 
Input coefficients ali,j), row-wise 
one row on each line 
7 630 
3 30 0 
2 28 10 
Input vector b 
13.3 3.9 6.9 


NO CON' 


GENCE IN 50 


Stop - Program terminated, 


ea METHOD OF RELAXATION 


Relaxation method represents a slightly modified version of the Gauss- 
Seidel method. The modification is aimed at faster convergence, The 
basie idea is to take the change produced in а Gauss-Seidel iteration 
step and extrapolate the new value by a factor r of this change. The new 
relaxation value is given by 


w den 


= rx axl) 


2m! qq pate 


(8.5) 


The parameter r is called the relaxation parameter. This step is ap- 
plied “successively” to each component of vector x during iteration pro- 
cess and, therefore, the method is known as successive relaxation method. 

The parameter r may be assigned a value between 0 and 2. We have 
the following possibilities: 

O<r<l under-relaxation 


1 no relaxation (ху ^ = x'D) 


l«r«2 over-relaxation 


For values of r between 1 and 2, an extra weight is placed on the 
present value and Eq. (8.5) really represents ап extrapolation. The 
intention here is to push the estimate closer to the solution. This method, 
when 1 < г < 2, is popularly known as successive over-relaxation (or 
SOR) method. It is also known as simultaneous over-relaxation method. 

The SOR technique can be easily implemented by a simple modifica- 
tion of the Gauss-Seidel algorithm. The relaxation value is obtained 
using Eq. (8.5) at the end of evaluation of each value of x. The extrapolated 
value becomes the new value of x for the next cycle. Equation (8.5) can 
be simply implemented as 
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+0 
Е 


г) х (8.6) 


+ А 
‘That is, the old value of x,"*” is replaced by the new value of a 1. 
е implementation of this step is shown in Algorithm 8.3. 
The choice of value of г depends on the problem and is often decided 
empirically. 


SOR method 


Algorithm is the same as Algorithm 8.2, except the statement 
(iii) Set dummy = suma; 
is replaced by a pair of statements 
Set dummy = sunva, 
Set dummy - rx dummy « (1 -rnx 


Algorithm 8.3 


Ё CONVERGENCE OF ITERATION METHODS 


Condition for Convergence 


We know that the iteration methods presented here are based on the 
basic idea of the fixed point method discussed in Chapter 6. We have 
shown that sufficient condition for convergence for solving one non- 
linear equation is 

IG'G)| <1 
and for two nonlinear equations, Mx, y) and G(x, y), аге 


Ge 


lar] { 

БЫ E SED 
aP| |26 

B = i bad 


These conditions apply to linear equations as well. Therefore, we can 
use these conditions in the Jacobi and Gauss-Seidel iteration methods. 

Fer the sake of simplicity, let us consider a two-equation linear 
system. We can express the Gauss-Seidel algorithm as follows: 


xy = Fo xy) a (b; - aq) 


(8.9) 
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z - 1 - 
x, = GG, х2) = — (bz -an x1) 
an 
Ja. Su. (8.10) 


ад Um 
The partial derivatives of these equations are 


and 


Substituting these values in Eqs (8.7) and (8.8), we get 


fen. 1 and caje 
а би 
This means that 
lanl > lay! (8.11) 
and 
lanl > |а2,1 (8.12) 


That is, the absolute value of diagonal element must bc greator than 
that of the off-diagonal element for each row. 

The above derivation can be extended to a general system of n equa- 
tions to show that. 


(8.13) 


For each row, the absolute value of the diagonal element should be 
greater than the sum of absolute values of the other elements in the 
equation, Remember that this condition is sufficient, but not necessary, 
for convergence. Some systems may converge even if this condition is 
not satisfied. 

Systems that satisfy the condition Eq. (8.13) are called diagonally 
dominant systems. Convergence of such systems are guaranteed. 


Rate of Convergence 
Consider the iterative Eqs (8.9) and (8.10). At (k+1)th iteration, we have 
rh Q5 0m. (8.14) 
"xu Ux 
«+з abe an юз (8.15) 
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Substituting forx(**” in Eq. (8.15), we get 


by b 
aera cm -2a [5 2] (8.16) 
аз Gy |an ац 


Similarly, we have 


gen EAE -muare] (8.17) 
в ац Gn 
Subtracting Eq. (8.16) from Eq. (8.17), we get 
kaa) _ Chew _ 01201 (ket _ g 
х5 x aun «у x”) 
Tf we denote the errors as 
Then 
(8.18) 


should have the coefficients such that 
aran гү 
аа 

This also conforms with Eqs (8.11) and (8.12). 


(8.19) 


Solve the equations 
Bx, +x, =5 
%-8%=5 
by the Gauss-Seidel method 
First, we rearrange the equations in the form. 


ху 0305 — xa) 
xq = 1/8(x, - 5) 
Assuming initial values asx? = 0, andx? = 0 


xi = 5/3 
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Remember, the new value of x, should be used in the calculation of new 
Xz. Therefore 


Similarly, 


i _ 1/5, 10) _ 55 
si «354-5 
The table below shows the values of x, and x, rounded to 4 decimal 
places. 


Iteration n 2, True error in x, | True error in x, 
0 ~ 0.0000 0.0000 2.0000 | 1.0000 
1 1.6667 inn 0.3333 0.1111 
2 20370 | -0.9877 0.0370 0.0123 
3 1.9959 | -10014 | 0001 | 00014 
4 2.0005 | -0.9999 | 0.0005 0.0001 
5 20000 | -10000 | 0.0000 | 0.0000 
The process converges to the solution (x, = 2, ху = -1) in five itera- 


tions. Note that the given system is diagonally dominant. The conver- 
gence is graphically illustrated in Fig. 8.2 


іх 


х -35-5 


xf? = 5/3 (stan) 


Diverging 


% 
Na 


Fig. 8.2 Pictorial representation of Gauss-Seidel convergence 
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_ Бава 


Solve the equations 
21 ~ 3x, = 
3с, +2, 
by the Gauss-Seidel method 


5 


Note that the system contains the same two equations as in Example 
8.3, except they are interchanged, 
The iterative equations are 


x, =5 + Jrg 
%, = 5 – Bx, 

As before, we start withx? = 0 andx? = 0. Then, 
х апі — xj?- -10 
x =-25 and x! = во 


x 2245 and x! = 730 


It is clear that the process does not converge towards the solution. Rather, 
it diverges (see Fig. 8.2). The result will be the same even if we start 
with the initial values very close to the solution (except the solution 


itself), Readers may try withx? = 2.5 andx® = -1.9. 


From Examples 8.3 and 8.4 we observe the following: 
1. Iteration process converges when 
a: 


E 2m 
аш аш] 


2. The process does not converge for the same set of equations when 
their order is changed. That is, when 


<1 and 


азау | 1 
апау; 
the process does not converge 
3. When it converges, the errors in x; and x, decrease by a factor of 


211022 
ааз 


at each iteration 
4. Stronger the diagonal elements, faster the convergence. 
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ERES SUMMARY 


Iterative methods provide an alternative to the direct methods for solv- 
ing linear equations. These methods are particularly suitable for solving 
ill-conditioned systems. We considered the following three iterative meth- 
ods: 

e Jacobi method 

• Gauss-Seidel method 

Successive Over Relaxation (SOR) method 
We also presented FORTRAN programs along with test results for the 
Jacobi and Gauss-Seidel methods, 

We have shown that a sufficient condition for convergence is that, for 
each row, the absolute value of the diagonal element should be greater 
than the sum of absolute values of the other elements in the equation. 


Key Terms 
Diagonally dominant system Relaxation parameter 
Gauss-Seidel iteration Successive over relaxation 
Jacobi iteration Successive relaxation method 


State the two popular approaches available for solving а system of 
~ — linear equations. 
What are the limitations and pitfalls of using direct methods for 
"d solving a system of linear equationa? 
O State the two important factors that are to be considered while 
applying iterative methods. 
© The basic idea behind the Jacobi iterative method is essentially the 
same as that of fixed point method used for solving nonlinear equa- 
tions. Explain. 
Pi Gauss-Seidel method is similar in principle to Jacobi method. Then, 
what is the difference between them? 
6. Show that, for а two-equation system 


ах + ауаз = by 
азху + ах; = by 
а sufficient condition for convergence of the iteration process is 


Gutala 

antz 
72 Explain the basic concept used in the relaxation method. 
В. What is relaxation parameter? 


iterative Solution of Linear eoo @ 


9. What is meant by over-relaxation and under-relaxation? 
10. Give an algorithm for solving a system of linear equations using 
the successive over-relaxation (SOR) method. 


lolve the set of equations given below by Jacobi method. 
B Bey бп, + Age 15 
4хү-х,+г=2 
хү- Bay + 72 = 22 


jolve the system of equations 


Qe -y+22=6 
e. 2х-у+г=3 


х+3у-2= 4 


by using Jacobi method. 
уе the systems given in Exercises 1 and 2 by Gauss-Seidel itera- 
fion. Compare the rate vf convergence in both the cases. 

^ Solve the pair equations 

/ t 21,25 

Зс + 


by applying Jacobi method to the equations 
/ хү=Б-2х, 
/ х= 5-35 
/ Observe the divergence. 
4. Solve the equations in Exercise 4 applying Gauss-Seidel method. 
Fi Compare the divergence with that of earlier one. 
6. Interchange the order of equations given in Exercise 4 and then 
solve them 
Aa) using Jacobi method 
(b) using Gauss-Seidel method 
Compare the convergence. 
/1. Solve the system of equations 
Ж 


/ Bx, 2х3= 5 
=, +2x,)-x,= 0 
-2x,4x32-1 
by applying 
(a) Jacobi method 
(b) Gauss-Seidel method, and 


(с) Successive over-relaxation method with г = 1.4 
Comment on the results. 
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8. Solve the following equations by Gauss-Seidel method 


2x – Ty - 102 = -17 
Бх+у+Зг= 14 
х+10у+Ье= 7 
Assume suitable initial values. 


. Monthly faculty salary in three departments of an institute is given 


below. Assuming that the salary for a particular category is same 
in all the departments, calculate the salary of each category of 
faculty. 


Department Number of Faculty — Total Salary 
Professor Asst. ProfessorLecturer (їл '000) 
А 2 UR оош o | 
B 3 1 2 50 
с 1 4 3 60 


. Mr. Ram has invested а sum of Rs 20,000 in three types of fixed 


deposits with an interest rate of 10%, 11% and 12%. He earns an 
annual interest of Rs 2,220 from all the three types of deposits. If 
sum of the amounts with 11% and 12% interest rates is four times 
the amount carning 10% interest, what is the amount invested in 
each type. 


Develop a menu-driven, user-friendly single program which pro- 
vides options for using either Jacobi method or Gauss-Seidel method. 


. Modify the Gauss-Seidel iteration program to incorporate the suc- 


cessive over relaxation method to improve the speed of conver- 
gence, 


Curve Fitting: 
Interpolation 


INTRODUCTION 


Scientists and engineers are often faced with the task of estimating the 
value of dependent variable y for an intermediate value of the indepen- 
dent variable x, given a table of discrete data points (x, yj), i = 0,1,..n. 
This task can be accomplished by constructing a function у(х) that will 
pass through the given set of points and then evaluating у(х) for the 
specified value of x. The process of construction of y(x) to fit a table of 
data points is called curve fitting. A table of data may belong to one of 
the following two categories: 

1. Table of values of well-defined functions: Examples of such tables 
are logarithmic tables, trigonometric tables, interest tables, steam 
tables, etc. 

2. Data tabulated from measurements made during an experiment: In 
such experiments, values of the dependent variable are recorded at 
various values of the independent variable. There are numerous 
examples of such experiments—the relationship between stress and 
strain on a metal strip, relationship between voltage applied and 
speed of a fan, relationship between time and temperature raise in 
heating a given volume of water, relationship between drag force 
and velocity of a falling body, еќс., can be tabulated by suitable 
experiments. 

In category 1, the table values are accurate because they are obtained 
from well-behaved functions. This is not the case in category 2 where 
the relationship between the variables is not well-defined. Accordingly, 
we have two approaches for fitting a curve to a given set of data points. 
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In the first case, the function is constructed such that it passos ugh _ 
all the data points. This method of constructing a function and esit 
value: led interpolation. The functio 
йге known as interpolation pre’vnomi 
* Та the second case, the values ae not accurate and, therefore, it will 
be meaningless to try to pass the curve through every point, The best 
strategy would be to construct a single curve that would represent the 
general trend of the data, without necessarily passing through the indi- 

. vidual points, Such functions are callec are called approximating functio ions, One 
popula ipproach for finding an an approximate { to fit a given sel - 
of experimental data is called least-squares regression. The approximat- 
ing functions are known as least-squares ‘polynomials, 

Figure 9.1 shows an approximate linear function and an interpolation 
polynomial for a set of data. Note that although the interpolation poly- 


Interpolation 
polynomial 


Linear approximating 
function 


Fig. 9.1 Curve fitting Іо a set of points 


nomial passes through all the points, the curve oscillates widely at the 
end and beyond the range of data. The linear approximating curve which 
does not pass through any of the points appears to represent the trend of 
data adequately. The straight line gives a much better idea of likely 
values beyond the table points. 

In this chapter, we discuss various methods of interpolation. They 
include: 

1. Lagrange interpolation 

2. Newton's interpolation 

8. Newton-Gregory forward interpolation 

4, Spline interpolation 

Before we discuss these methods, we introduce various forms af poly- 
nomials that are used in deriving interpolation functions. Least-squares 
regression techniques are discussed in the next chapter. 
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POLYNOMIAL FORMS 


The most common form of an nth order polynotnial id 


р ot OX tage t+ “| (9.1) 


This form, know as the power form, is very convenient for differentiat- 
ing and integrating the polynomial function and, therefore, are most 
widely used in mathematical analysis. However, there are situations 


where p form has.heen found inadequate, as illustrated by Example 


hi A 


/// Consider the power form of pix) for n = 1, 
Я Pix) = ау bay, 
Given that = 
p(100) = +3/7. 
(101) = – 4/7 „ 
obtain the linear polynomial p(x) using four-digit floating point arith- 


metic. Verify the polynomial by substituting back the values x = 100 and 
х = 101. 


p(100) = d; 100 a, = + 0.4286 
(101) ay + 101 a, =- 0.5714 


Then, we get ‘ 
а=-1 
@- 100.4 (only four significant digits) 
Therefore, T 
р(х) = 1004 - x 
using this polynomial, we obtain 
10100) = 04 


p101) = - 0.6 


Compare these results with the original values of p(100) and p(101). We 
have lost three decimal digits. 


Example 9.1 shows that the polynomials obtained. using the power form 
may not always produce accurate results. In order to overcome such 
problems, we have alternative forms of representing a polynomial. One 
of them is the shifted power form as shown below: 


PO) = + а, 6 - C eos -СЁ+ ra G- OF | (9.3) 
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where C is a point somewhere in the interval of interest. This form of 
represehtation significantly improves the accuracy of the polynomial 
evaluation. This is illustrat ' Ьу Fxample 9.2. 


ини  __ 


‘Repeat Example 9.1 using the shifted power form and fovr-digit arith- 


Shifted power form of first order p(x) is 
pix) = ау + a, (& - C) 
Let us choose the centre C as 100. Then 
р(х) = ay + a, (x - 100) 
This gives, 
p(100) = a, = 3/7 = 0.4286 
p(101) = 0.4286 + a, (101 — 100) = – 0.5714 
ау=-1 
Thus the linear polynomial becomes 
pix) = 0.4286 — (x — 100) 
Using this polynomial, we obtain 
p(100) = 0.4286 
p(101) = - 0.5714 
Note the improvement in the results. 


Note that Eq. (9.2) is the Taylor expansion of p(x) around the point C, 
when the coefficients a, are replaced by appropriate function derivatives, 
It can be easily verified that 


(ay 
a; =2 CO i0 1,2, an 
i! 


where p"XC) is the ith derivative of p(x) at С. 
There is a third form of p(x) known as Newton form. This is a 
gencralised shifted power form as shown below: 


р(х) = ау + а (x - C) + а; e — CO (x C) + ау (x - C) 
к - Су) к ~ Cy) +... + „б С) Œ- C)... G- Cy) 


(9.3) 
Note that Eq. (9.3) reduces to shifted pwer form when C, = C = Cy =... 
= C, and to simple power form when C, = 0 for all i. The Newton form 
plays an important role in the derivation of an interpolating polynomial 
as seen in Section 9.5. 
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Polynomials can also be expressed in the form 
ру (x) = by (x ~ ху) & — x3) 
+ by (x — xo) (x — x2) 
+ by (x — xq) к - xy) 


Tn general form, 


ET] LINEAR INTERPOLATION 


The simplest ea of interpolation is to approximate two data points by 


(9.4) 


a straight line{ Suppose we are given two points (x,, flx,)) and (ху, Alxg)). 
These two points can be connected linearly as shown in Fig. 9.2. Using 
the concept of similar triangles, we can show that 


Ро) Гр) Га) fix) 


x-x, X2 -X; 
d 
Ф, 100 Pa 
fi») 
d, 9 pi 
х, x x xx 


Fig. 9.2 Graphical representation of linear interpolation 


Solving for ftx), we get 


] 


fx4)- f(x,) 


ay 


fe) = flay) + (x - x) (9.5) 
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Equation (9.5) is known as linear interpolation formula. Note that the 
term 


f 


fx) 
“1 
represents the slope of the line. Further, note the similarity of equation 
(9.5) with the Newton form of polynomial of first-order. 
Сү=х, 
ao = о) 


fixa) - fix) 
= 


а = 


у } 1 
LIT he HE 


fix) i Т 14142 


= 
Determine the square root of 2.5) » 


їл | 2 | 22361 


The given value of 2.5 lies between the points 2 and 3. Therefore, 
x,22, Р) = 14142 
fx) = 1.7821 


Then 


1.7321 - 1.4142 


5) = (25-2. o I a 
f(2.5) = 1.4142 + (2.5 – 2.0) 30-20 


= 1.4142 + (0.5) (0.3179) 

5732 

The correct answer is 1.5811. The difference ig due to the use of a linear 
model to a nonlinear опе. _// \У Е 


Now, let us repeat the procedure assuming x, = 2 and x; = 4, 
fixi) = 1.4142 
fig) = 2.0 

Then, 

2.0- 1.4142 


2.5) = 1.414: -5 — 2.0) —— — 
fi 1.4142 + (2.5 - 2.0) 10-20 
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= 1.4142 + (0.5) (0.2929) 


= 1.5607 

Notice that th: error has increased from 0.0079 to 0.0204. In general, 
the smaller the inea] between the interpolating data points, the bet- 
ter will be the approxenation. 

The results could be improved considerably by using higher- -order 
interpolation polynomials. We shall demonstrate this in the next section. 


^ BB LAGRANGE INTERPOLATION POLYNOMIAL 


In this section, we derive a formula for the polynomial of degree n which 
takes-specified values at a given set of n + 1 points. 
wet xo, ху, ... x, denote n distinct real numbers and let fay fay f, be 
arbitrary real numbers. The points (xp, fo (ху, fi), «- (кь. fa) can be 
imagined to be data values connected by a curve. Any function p(x) 
satisfying the conditions 
pix=f, for k=0,1,. 
is called an interpolation function. An TASSE function is, there- 
fore, a curve that passes through the data points as pointed out in 
Section 9.1. 
Let us consider a second-order polynomial of the form 
pa (x) = bil - xo) (xx) $ 
+ balx- x, Y = x) 
+ balx - ху) (x = xo) (9.6) 
If (xo, fo), (x1, fy) and (ху, fy) are the three interpolating points, then we 
have 


рух) = fo = Бухо = ху) G9 = xa) 
PA) = he bir, - ху) ба - хо) 
рай) by jo — хо) (Xz — ху) 
Substituting for b}, b; and.b, in Eq. (9.6), we get 


(х-ху)}(х-х») 


Pat) = foe aa xp 


pp eras) 
1 xg) - x9) 


(х—х0)(х- х1) 
"hr xo 
Equation (9.7) may be represented as. 


ра) = fo lf) + fy L + fa lala) 


xj) (9.7) 
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2 
= Dili (x) 
i-o 
where 
з (х-х;) 
w= П = 
бин (® EJ 
In general, for +1 points we have nth degree polynomial as 

(9.8) 


P, lx) = SAL 
0 


where 
(9.9) 


Equation (9.8) is called the Lagrange interpolation polynomial. The poly- 

nomials /, (x) are known as Lagrange basis polynomials, Observe that 
1 fori-j 

i= fo fori# j 


Now, consider the case n = 1 
х-Х\ 


lolx) = 
ы 


Therefore, 


Рб) fix - xo) 
ч Xq-X1 
fi- fo 
hti a O 


ation formula 


This е linear inter 


Consider the problem in Example 9. 
ime second order 


ind the square root of 2.5 using 
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Let us consider the following three points: 

X972, x,-3, and  x,-4 
Then 

fo-14142, f,= 1.7821, апі  f,-2 
For x = 2.5, we have 


(2.5 ~ 3.0) (2.5 — 4.0) 


02.5) == m 
(2.0-3.0) (2.0 4.0) 
aet 


0.3750 


p) -25-20)(2.5-4.0) _ 
105) =F IET ar 


12.5) EG 0 


P2.5) = (1.4142) (0.3750) + (1.7321) (0.7500) + (2.0) (- 0.125) 
= 0.5303 + 1.2991 - 0.250 = 1.5794 


The error is 0.0017 which із much less than the error obtained in 
Example/9.3 


= – 0.125 


bi 
‘Find the Lagrange interpolation polynomial to fit the following data, 
a a 
oy 1 2 E] 
0° 1 XE [| 3 
E 137183 6.3891 | 190855 | > $c J 


Ца] to estimate the value of el, 


Lagrange basis polynomials are 


20 
pa- ED- — ^"^. 
(0-1) 0-H 0-3) WE ж. 
wy” 
155-612 +11х-6 ay? 
EM ы . 
: a? 
М-сы 0 
Sy Д 
-5x2 46x \y * 10055 


2 E 2M 


КАКЫ" 
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+ Aw 
Ly cO - be -9 hib ux 
TU. C-02-0(0-3) " 
_х%-4х®+3Зх 
“8 


(x -0)(x - 2) (x - 3) 


H9 = убуз 183—2) 


3х? +2х 


p(1,5) = 0.9375 
The interpolation polynomial is 
plx) = fe lolx) f, 10) + fa 150) + fa bg) 
"1.7183 (43 - 5x? +бх) 
Agee E EE 
{ 2 
6.3891(x? - 4x? « 3x) 
A ——————ÓÉ— 
n / 
/ 
` 19.0856 (x - 3x? +2x) / 
\ А 6 / 


? 4115987x 


5.07329 6.3621 


Г 6 
{ 0.8455х° — 1.0604 x? + 1.9331x 


(1.5) = 3.3677 
el^ = p(1.5) + 1 = 4.3677 


Points to be noted about Lagrange polynomial: 
1. It requires 2(n+1) multiplications/divisions and 2n+1 additions and 
subtractions 
2. If we want to add one more data point, we have to compute the 
polynomial from the beginning. It does not use the polynomial al- 
ready computed. That is, p,, ,x) does not use p,(x) which is already 
available 


Program LAGRAN 


Program LAGRAN computes the interpolation value at a specified point, 
given a set of data points, using the Lagrange interpolat.on polynomial 
representation. 
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*Main program 


* 


Г" 


10 


Functions invoked 


Constants used 


specified point, given a set of data points, using 
the Lagrange interpolation representation 


* 
This program computes the interpolation value at a * 
* 
* 


NIL 


NIL 


Variables used Е 
XN Number of data sets - 
X(T)- Data points = 
F(I)- Function values at data points » 
XP - Point at which interpolation is required  * 
FP - Interpolated value at XP " 
LF - Lagrangian factor. * 


MAX - Maximum number of data points permitted M 


INTEGER N,MAX 

REAL X,F,FP,LF,SUM 
PARAMETER(MAX = 10) 
DIMENSION X(MAX),F(MAX) 


WRITE(*, *) ‘Input number of data points(N)' 
READ(*,*) N 
WRITE(*, *) ‘Input data points X(I) and Function’, 


"values F(I)' 

WRITE(*,*) ‘one set in each line’ 
DO 10 I = LN 

READ(*,*) X(I), F(I) 
CONTINUE 
WRITE(*, *) "Input X value at which* 
WRITE (*,*) ‘interpolation is required’ 
READ(*,*) ХР 
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20 


30 


IP (I.NE.J) THEN 
LF = LF * (XP - X(J)) / (X(I) - X(J)) 
ENDIF 
CONTINUE 
SUM = SUM + LF * F(I) 
CONTINUE 
FP = SUM 


WRITE(*, *) 

WRITE(*,*) ‘LAGRANGIAN INTERPOLATION’ 
WRITE(*, *) 

WRITE(*,*) ‘Interpolated Function value’ 
WRITE(*,*) ‘at X = ', XP, * is', FP 
WRITE(*,*) 


STOP 


T-------------- End of main LAGRAN ----------------- + 


Tes! Run Results The program was used to compute the function value 
at x = 2.5 for the following table of data points: 


The results are shown below: 


x | 2 3 4 
T | 14142 17321 | 20 


Input number of data points(N) 
3 

Input data points Х(Т) and Function values F(T) 
one set in each line 

2 1.4142 

3 1.7321 

4 2.0 

Input X value at which 
interpolation is required 

2.5 

LAGRANGIAN INTERPOLATION 


Interpolated Function Value 
at X - 2.5000000 is 1.5794000 


Stop - Program terminated. 


ee 
ES NEWTON INTERPOLATION POLYNOMIAL 


We have seen that, in Lagrange interpolation, we cannot, use the work 
that has already been done if we want to incorperate another data point 
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in order to improve the accuracy of estimation. It is therefore necessary 
to look for some other form of representation to overcome this drawback. 

Let us now consider another form of polynomial known as Newton 
form which was discussed in Section 9.2. The Newton form of polyno- 
mial is 

р(х) = ag + aux - ху} + азіх – x9) (x - x) 
+... + apla = хо) (х— ху). - x.) (9.10) 
where the interpolation points xp, х1, ... x, act as centres. 

To construct the interpolation polynomial, we need to determine the 
coefficients ay, ay, ... а„. Let us assume that (xp, fo), (ху, fi), na» fra) 
are the interpolating points. That is, 

р(х) = Г, k=0,1,..n-1 
Now, at x = xy, we have (using Eq. (9.10)) 

paito) [ay = f] (9.11) 
Similarly, at x = x), 

ра) = Ug + ay, = xy =f 
Substituting for ay from Eq. (9.11), we get 


(9.12) 


At x = хо, 
рибка) = а + ах; — хо) + айлу - хо) (xa = x) = fa 


Substituting for a; and a, from Eqs. (9.11) and (9.12) and rearranging 
the terms, we get 


ge (fe - Mær - x0 - Is -foer - хо)! (9.18) 
X4-X9 


Let us define a notation 


ПЕД 


fe 

_ Дхь]- fixa] 

йай ы шш т 

flapa xg al fT te teed 


ы Ба. 


fix, xis Xe eal = 


ii Leona Ланы (9.14) 
T mE 


fixe xin 
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These quantities are called divided differences. Now we can. express the 
coefficients a; in terms of these divided differences. 
ав = fo = fl] 
fi-fo 


as 
21-х0 


fixo, х1] 


fh-h А-һ 
2-11 _Х{-%, 


а, 
Ы хз-5% 


Паха] хо, xi] 


ху-ху 


= flxo, x), ху] 
Thus, 
a, = fl, ху, t] (9.15) 


Note that a, represents the first divided difference and аз the second 
divided difference and во on. 
Substituting for а, coefficients in equation (9.10), we get 


Граб) = fixo] fixo, ху Ge — xo) + flag, ху, ху] (xe — x9) (a — EN 
_— +... 
* flxo, xy, 1 Xp] @- xo) (к- ху)... (x = x2) 
This can be written more compactly as 


| NN 
| раб) = Y flos] TI G- n) (9.16) 
ё іч 


Equation (9.16) is called Newton's divided difference interpolation poly- 
nomial. 


М 


Given below is a table of data for log x. Estimate log 25 using second 
order Newton interpolation polynomial. = Ы 


{ ofa | Ж ls 
" 1] oz | з [| 4 
log x; j 0 0.3010 0.4771 | 0.6021 

| 


Second. order polynomials require only three data points, We use the 
first three points 


“д 
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ag = Ах = 0 4^ 


Рол) fig 0.3010 

^Xiox, ^ 3-1 =03010 
Lee { 

|Ййхь.®»1- хө, zi] | 


%2-X% | 


са = Йо ху] = 


708 = ЇЇ» x. = 


_ 0.4771- 0.3010 
Е 3-2 ° 


=0.1761 


Therefore, 
_ 0.1761- 0.3010 
fr 3-1 
рух) = ay +аүх- Xo) + ax — xo) (x — х) 
0 + 0.3010 — 1) + (— 0.06245) (x- 0) (к - 2) 


[виа ) F 0.3010 » 1.5 – (0.06245) (1.5) (0.5) 
04518—00468— - E 


204047 (2 


Note that, in Example 9.6, had we used a linear polynomial, we would 
have obtained the result as follows: 


pix) = а, + ayx - xy) 
210.5) = 0 + 0.3010 (1.5) = 0.4515 


This shows that р (2.5) is obtained by simply adding a correction factor 
due to third data point. That is 


Рух) = рү(х) + ах — xo) (x — ху) 
= р(х) + Ay 


If we want to improve the results further, we can apply further correc- 
tion by adding another data point. That is 


Рзіх) = polx) + Ay 


= + 0.06245 


where 

Ag = аух - xy — x, Xx - x9) 
This shows that the Newton interpolation formula provides a very con- 
venient form for interpolation at an increasing number of interpolation 
points. Newton formula can be expressed recursively as follows: 


Pr AX) + fIxo, x4 lia GO (x4) (9.17) 


З 
where pala) = f[xo,...x;ló (x). Xa, 6;(x) 
<0 


and — dj) ex xy) Gc). a) ү) 
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I n DIVIDED DIFFERENCE TABLE 


We have seen that the coefficients of Newton divided difference interpo- 
lation polynomial are evaluated using the divided differences ni the 
interpolating points. We have also seen that a higher-order divided dif- 
ference is obtained using the lower-order differences. Finally, the first- 
order divided differences use the given interpolating points (i.e. x, and 
f, values). For example, consider the second-order divided difference 

аз = flap, ху, ху] 


_ flxy x0) - flo, 
Мы... 

where f[x,, x] and /1хо, x;] are first-order divided differences and are 

given by 


fis s] 0 0) iT. 


Xi-Xo х1-х0 
Ег 
S-21 


This shows that, given the interpolating points, we can obtain recur- 
sively a higher-order divided difference, starting from the first-order 
differences. While this can be conveniently implemented in a computer, 
we ean generate a divided difference tuble for manual computing. A 
divided difference table for five data points is shown in Fig. 9.3. A 
particular entry in the table is obtained as follows: 


lixe Xa X4] хохо. X31 


fps. Mos жу] = 


і x fix} First Third Fourth 
difference difference difference difference 


0 % ШЕ 
fixo xil 


t & fix} хх, Xal 
© B" E lc. Xy Xo. X] 
2 % пх AIX) Xo Xal fl. oss Xal 


Haa xd 
за dl ARA 


FX, X] 
4 бў— fear 


Fig. 9.3 Divided difference table 


Draw the two diagonals from the entry to be calculated through its 
neighbouring entries to the left. If these lines terminate at f(x;) and f(z;), 
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then divide the difference of the neighbouring entries by the correspond- 
ing difference x, — x;. The result is the desired entry. This is illustrated 
in Fig. 9.3. for the entry [x,, xy, ху, x. 

When the table is completed, the entries at the top of each column 
represent the divided difference coefficients. 


mum —  ..|. 


Given the following set of data points, obtain the table of divided differ- 
ences, U 2 the table [р наш the value of f(1.5). 


3 4 
4 5 
68 194 | 


The divided difference table is given below: 


i x | fee) | First | Second T Third | Fourth 
| difference | difference | difference | difference 


The value of polynomial at x + 15 is computed as follows: 
| dse С 7 А 
| BB = 0+ 71.5- 1) = 3.5 
| poll.5)= 3.5 + 12(1.5 - 1X1.5 - 2) = 0.5 
| p4(1.5) = 0.5 + 6 (1.5 – 101.5 – 2X1.5 - 3) = 2.25 
PALS) = 2.25 + 0 = 2.25 
The function value at x = 1.5 is 2.25 


Note that p,(1.5) = p,(1.5). This implies that correct results can be ob- 
tained using the third-order interpolation polynomial. It also illustrates 
that we can compute f(1.5) in stages (recursively) using interpolation 
polynomials in increasing order. Computation is terminated when two 
consecutive estimates are approximately equal or their difference is within 
а specified limit. 
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It is clear that the computational effort required in adding one more 
data point to the estimation process is very much reduced due to the 
recursive nature of computation. 

Let us have a close look at the divided difference table of Example 9.7, 
Notice the constant values under the column "third difference" and zero 
value under the column “fourth difference". Recall that the first divided 
difference is given by 


fx - fixo) 
Ripe 


flixo,xil- 


This is nothing but the finite divided difference approximation of the 
first derivative of the function. Similarly, fxg, ху, ху] is the second de- 
rivative and so on. Since the third derivative is constant, the function 
f(x) should be a third-degree polynomial. In fact, the function used in 
Example 9.7 is 

Гоа) =? -1 
and therefore 


and the fourth derivative is zero. 


Program NEWINT 


Program NEWINT constructs the Newton interpolation polynomial for a 
given set of data points and then computes the interpolation value at a 
specified value. 


села 16 

* n program 

* This program constructs the Newton interpolation 

*  polynonial r a given set of data points and then * 

*  computes terpolation value at a specified value * 
Й 


* Functions invoked ` 
* NIL * 
Ё азына OO =% 
* Subrou . 
ы NIL * 
* Variables used Ж 
* N - Number of data points * 
* X - Array of independent data points * 
* F - Array of function values * 


for interpolation 


* 


* ХР - Desired point 
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ЕР - Interpolation value at XP * 
D - Difference table ` 
A - Array of coefficients of interpolation . 

polynomial E 


жизн в 


NIL . 


INTEGER N 
RE, XP,FP,SUM, PI,X,F,A,D 
DIMENSION X(10),F(10),A(10),D(10,10) 


WRITE(*,*) ‘Input number of data points” 
READ(*,*) N 
WRITE(*,*) ‘Input the values of X and Р(х), ', 
* ‘one set on each line’ 
po 10 I= 1,N 
READ(*,*) X(I), F(I) 
10 CONTINUE 


* Construct difference table D 


DO 20 I = 1,N 
0(1,1) = F(T) 
20 CONTINUE 


ро 40 J = Z,N 
DO 30 I = 1, М№-7+1 
DULJ) = (D(I«1,7-1)-D(1,J-1))/ (X (47-1) -X(1)) 
30 CONTINUE 
40 CONTINUE 


* Set the coefficients of interpolating polynomial 


DO 50 J = 1,N 
A(J) = D(1,J) 
50 CONTINUE 


* Compute interpolation value 


WRITE(*,*) ‘Input XP where interpolation is 
required’ 
READ(*,*) XP 


SUM = A(1) 

po 70 I =2 
PI = 1.0 
DO 60 J = 1, 1-1 
PI = PI *(XP-XlJ)) 


iN 
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60 CONTINUE 
SUM = SUM + A(I) * PI 
70 CONTINUE 


* Write results 


WRITE(*,*) 
WRITE(*,*) ‘NEWTON INTERPOLATION’ 
WRITE(*,*) 

WRITE(*,*) ‘Interpolated Function value’ 
WRITE(*,*) ‘at X = ', XP, * is', FP 
WRITE(*,*) 

STOP 

END 
---- ---------- End of main NEWINT - 


Test Run Results Let us use the same table values that were used for 
testing the program LAGRAN., Test run results are given below: 


Input number of data points 
3 

Input the values of X and F(x), one set on each line 
2 1.4142 


Input XP where interpolation is required 
2.5 

NEWTON INTERPOLATION 

Interpolated Function Value 

at X = 2.5000000 is 1.5794000 


Stop - Program terminated. 


m i INTERPOLATION WITH EQUIDISTANT POINTS 


In this section, we consider a particular case where the function values 
are given at a sequence of equally spaced points, Most of the engineering 
and scientific tables are available in this form. We often use such tables 
to estimate the value at a non-tabular point. Let us assume that 
zxQ7xo kh 

where xp is the reference point and h is the step size. The integer & may 
take either positive or negative values depending on the position of the 
reference point in the table. We also assume that we are going to use 
simple differences rather than divided differences. For this purpose, we 
define the following: 
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The first forward difference Af; is defined as 


Mio fea f 
The second forward difference is defined as 
A fie Mir M 
In general, 
Аў = S fuo (9.18) 


We can now express the simple forward differences in terms of the di- 
vided differences, We know that 
fix - fixo) _ fy - fo 


ley) = ee Ат 
‘Therefore, 
fi = fo= h fleo х1] 

Then 

Moz ~ fo =h fles xi 
Similarly, 

М, = h fle xal 
Now, 

A foe Af = My 


= h fln х5] = h [хө xi] 
= h (fle, xy) - f Ixo x, 
=h. 2h . flo, xj, xal 
STA Fiati 
In general, by induction, 
N f s jt h f lxi ху, xul 
Therefore, 


A fo 
Flay xy x) = TA 


Substituting this in the Newton's divided difference interpolation poly- 
nomial (Eq. (9.16)) we get, 


| a Al fy 4-3 
| Palt)= Y j'hi e- (9.19) 
Let us set 
x-2x,*sh and p,(s) = p.a) 
We know that 
Xy Xy kh 
Thus we get 


x-x,-(s-kW 
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Substituting this in Eq. (9.19), we get 


л АР 1-1 
р.) = Y, —— [T (s- 9h 
кту! ары 


a Al fy 4 
= 2, Fras 1868 1..(s- j+ D] A/ 
Thus, 
(9.20) 


where 


is the binomial coefficient. Equations (9.19) and (9.20) are known as 
Gregory-Newton forward difference formula. 


Forward Difference Table 
The coefficients Af, can be conveniently obtained from the forward dif- 
ference table shown in Fig. 9.4. According to Eq. (9.18), each entry is 
merely the difference between the two diagonal entries immediately on 
its left. That is 

Af e fa n f 
The differences which appear on the top of each column correspond to 
the differences of equation (9.20). 


x f Af at ay | л АУ ГУД 


Xs D^ 5 [ 


Fig. 9.4 Forward difference table 
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As pointed out earlier, difference tables can be used not only to estimate 
the value of the function at a non-tabular point but can also be used to 
decide on the degree of the interpolating polynomial that is most appro- 
priate to the given data points. 


Estimate the value of sin баё A= 25° using the Newton-Gregory forward 
difference formula with the help of the following table. 


w | 2 тз [ 4 [ so 
0.1736 0.3420 0.5000 0.6428 0.7660 


T T T C 
al 4 à 
Ж "ume | = = 
[20 | 0343 - 
30 
0 
50 
Therefore, 


Using Eq. (9.20), we have 
pils) = 0.1736 + (1.5) (0.1684) = 0.4262 
(1.5) (0.5) (-0.0104) 


pos) = 0.4262 + 3 «04223 
pals) = 0.4223 ZENO SND 50.0048) = 0.4220 
pals) = 0.4220 + 119) (0.51 (70 
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Thus, 
sin 25 = 0.4220 


which is accurate to four decimal places. 


Backward Difference Table 


If the table is too long and if the required point is close to the end of the 
table, we can use another formula known as Newton-Gregory backward 
difference formula. Here, the reference point is x,, instead of x, There- 
fore, we have 

x=x,+sh 


Xp = Xp- kh 

x= x= (s + kh 

Then, the Newton-Gregory backward difference formula is given by 
ss 


p,(8) =f, +8 Vf, + i +... 


2! 


ae (9.21) 
al 

For a given table of data, the backward difference table will be identical 
to the forward difference table. However, the reference point will be 
below the point for which the estimate is required. This implies that the 
value of s will be negative for backward interpolation. The coefficients 


Vj f; can be obtained from the backward difference table shown in Fig. 9.5. 
x f E. Nn J v ki vit уч | ver 
E 


ЕЖ 
| gy in = + " 


| ve i 


| 4 
| v& Уз, 
ЕШ | 


САЛ 


| 
Fig. 9.5 Backward difference table 
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Repeat the estimation of sin 25 in Example 9.8 using Newton's back- 
ward difference formula 


QUT) _ 25- 
a 3 
Using Eq. (9.21), we get — 
p((2.5) = 0.7660 + (—2.5) (0.1232) (3 
C2 C L8) (0.0196) 
3 
C348 C 1.8) C-0.5) (0.0044) 
6 
42:0 C18 0.5) (0.5) (-0.0004) 
сарЫ СБ) (ЧИК) 
94 


-2.5 


= 0.4200 


E87] SPLINE INTERPOLATION 


So far we have discussed how an interpolation polynomial of degree n 
can be constructed and used given a set of values of functions. There are 
situations in which this approach is likely to face problems and produce 
incorrect estimates. This is because the interpolation takes a global rather 
than a local view of data. It has been proved that when n is large 
compared to the order of the “true” function, the interpolation polyno- 
mial of degree n does not provide accurate results at the ends of the 
range. This is illustrated in Fig. 9.6. Note that the interpolation polyno- 
mial contains undesirable maxima and minima between the data points. 
This only shows that increasing the order of polynomials does not neces- 
sarily increase the accuracy. 


Fig. 9.6 Interpolation polynomial ої degree 11 of the function 1 


х2 
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One approach to overcome this problem is to divide the entire rang 
of points into subintervals and use local low-order polynomials to inter 
polate each subinterval. Such polynomials are called piecewise polyno 
mials. Subintervals are usually taken as [x,, x;,,], i = 0, 1, ... п as illus 
trated in Fig. 9.7. 


Piecewise polynomials 


—À ^V 


Xa x ы 
Fig. 9.7 Piecewise polynomial interpolation 


Ха 


Notice that the piecewise polynomials shown in Fig. 9.7 exhibit dis- 
continuity at the interpolating points (which connect these polynomials), 
It is possible to construct piecewise polynomials that prevent such 
discontinuities at the connecting points. Such piecewise polynomials are 
called spline functions (or simply splines), Spline functions, therefore, 
look smooth at the connecting points as shown in Fig. 9.8. The connect- 
ing points are called knots or nodes (because this is where the polynomi- 
al pieces are tied together). 


ax? + Ox +6, 8, X + Di AX + С, 
fix) 
де ны, 1864 
УЙ b A, 
Se F(X, 2) 
X. х хл Maz 


Fig. 9.8 Secona degree spline polynomials 
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A spline function s(x) of degree m must satisfy the following conditions: 

1, s(x) is a polynomial of degree atmost m in each of the subintervals 

bx, xul E20, 1n 

2. s(x) and its derivatives of orders 1,2 ... m — 1 are continuous in the 

range [x,, x,l. 

According to the first condition, each interval will have a different 
polynomial of degree m or less. The set of all polynomials form a spline 
interpolation polynomial, if (x,) = f,, for i = 0, 1, ... n. The process of 
constructing such polynomials for a given set of function points is known 
as spline interpolation. 


re 


State whether the following piecewise polynomials are splines or not, 


х+1 -15х50 
(i) f(x) =42х+1 0<х<1 
4-х 1<$х<2 


x?41 05х51 
(ii) fe) = 12x? 1<х<2 
5х-2 2<х<3 


n=4, xo =-1 X;-0, җ=1, x22 


físx*1 
fix) = 2x +1 
file) =4—x 
Then, 
fy)=04+1=1 
fale) = 2x 04151 
fíz)-24148 
fíx)-4-1-3 
Note that 


Һа) = а) and р) = fate) 
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Therefore, the piecewise polynomials are continuous and fix) is а linear 
spline. Note that the first-derivative is not continuous and, therefore, 
f(x) is not a second-degree spline. 


Case (ii) 

Given, 

n=4, =, а= xy,=3 
Ло) = +1, fitx) = 2x 
fila) = 212, fix) = 4x 
Во) = Bx - 2, fala) = 5 

Then, 
fie) =1+1=2, fix) =2 
МБ) =2х1=2, К) =4 
Ро) = 2 х4 = В, Ро) = 8 
Міх) = 552-2 =8, О) = 5 


Polynomials are continuous but their derivatives аге not. Therefore, f(x) 
is not a spline. 


Case (iii) 
Given, 
n=4, ху= 0, ху=1, х)=2, xg-23 
fix, ош) 2 1, ftx 
fíx)-x!-x1, [у(х)=?х-1, h'a) = 
Рх) = Bx - 3, ful) = 3, fiin = 0 
Then, 
fix) = 1 fix) = 1 
fx) = 1 Во) = 1 
fix) = 3 fi = 3 
fx: = 3 fi) = 3 


Since both the polynomials and their first derivatives are continuous in 
the given interval, f(x) is a second-degree spline. Note that the second 
derivatives are not continuous. 


Cubic Splines 


The concept of splines originated from the mechanical drafting tool called 
“spline” used by designers for drawing smooth curves. It is a slender 
flexible bar made of wood or some other elastic material. These curves 
resemble cubic curves and hence the name “cubic spline” has been given 
to the piecewise cubic interpolating polynomials. Cubic splines are popu- 
lar because of their ability to interpolate data with smooth curves. It is 
believed that a cubic polynomial spline always appears smooth to the 
eyes. 
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We consider here the construction of a cubic spline function which 
would interpolate the points (xo, fo), (x1, fj), ~.. (х„, fn). The cubic spline 
s(x) consists of (n — 1) cubics corresponding to (n — 1) subintervals. If we 
denote such cubic by зх), then 

s(x) = 5x), = 1,2, 

As pointed out earlier, these cubics must satisfy the following condi- 

tions: 


1, s(x) must interpolate f at all the points xo, х, ... x, Le, for each i 


8) =f; (9.22) 
2. The function values must be equal at all the interior knots 
зй) = 610) (9.23) 
3. The first cerivatives at the interior knots must be equal 
8;'(x,) = si. Gà) (9.24) 
4. The second derivatives at the interior knots must be equal 
si'le) = sj," Gc) (9.25) 


5. The second derivative at the end points are zero 
s(x) = s") = 0 

Step 1 
Let us first consider the second derivatives. Since s; (x) is a cubic 
‘unction, its second derivative s(x) is a straight line. This straight 
ine can be represented by a first-order Lagrange interpolating 
»olynomial. Since the line passes through the points (x;, s;'"(x;)) 
ind (x, 4, 5/(,.,)), we have, 

z- 


в) e s) eara) 


ii 
5-1 


(9.26) 


à 


the unknowns s;"(x;;) and s;"(x,) are to be determined. For the sake of 
implicity, let us denote 


sesa and 
x- x= Uy; 
Xis Xj кещ. Mj 
hen, Eq. (9.26) becomes 
" 
G6) аа ug, 


=h, 


i hi 
Bi - 2, Uu; 
р аы (9.27) 
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Step 2 
Now we can obtain 5,(х) by integrating Eq. (9.27) twice. Thus 


аш? 


s(x) = + Сух + С (9.28) 


6h 


d 


where C, and C, are constants of integration [observe that du/d, = 1 
and, therefore, differentiation and integration with respect. to x and with 
respect to ш; will be equivalent]. The linear part Сух + C, can be ex- 
pressed as 

by(x-x + by (x x) 
with suitable choice of b, and bp. 
Therefore, 

C x + Cy = by (x -x, р) + by x- x) 
= by uj + bo uy 


Then, Eq. (9.28) becomes, 


з 
аш} 


ва) = ——— 


Step 3 
Now, we must determine the coefficients b, and b. We know that, by 
condition 1, 


з(х)=/; апі s&h 
Atrzr, 


Similarly, at x = x, 
Uii 
and therefore 


(9.29a) 


(9.29b) 
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Substituting for 6, and b, in Fq. (9.29) and after rearrangement of 
terms, we get 


а 
а) = 


6h; 


+ (ui i fiu) (9.30) 


Note that Eq. (9.30) has only two unknowns, а, ; and a;. 


Step 4 
The final step is to evaluate these constants. This can be done by invok- 
ing the condition 


sia) = sí ухо 
Differentiating Ед. (9.30) we get 


ayy 4 D, 
вү(х) - —— th," - Зи 
6h 


^ Tw Gu А 


Setting x = x,, 


sx) 
Similarly, 
state) = 
Since 
s/a) 7 1.35) 
We have 


һа, +20 + да а = (9.31) 


“1 i 


Equation (9.31), when written for all interior knots (i = 1, ... л - 1), we 
get n – 1 simultaneous equations containing n + 1 unknowns (ap, ау, ... 
a,). Now, applying the condition that the second derivatives at the end 
points are zero, we get 


a,=0,=0 


306 Numerical Methods 


Thus, we have л — 1 equations with л ~ 1 unknowns which can be 
easily solved. 


Note 
The cubic splines with zero second derivatives at the end points are 
called the natural cubic splines. This is because the splines are 
assumed to take their natural straight line shape outside the inter- 
vals of approximations. 


The system of n — 1 equations contained in Eq. (9.31) can be expressed 
as 


Ahy+hy) hy о. 0 0 0 
hy hath) hg - 0 0 0 
0 o D + Aa ak. f 
0 D] о 0 lisi 2(һ„-у+!һ„) 
e D, 
а р, 
“| :.| (9.32) 
985-1] Da-i 
where 


Given the data points 


TELDE 2 
E 4 9 16 
E 2 TEF 


һ=-ху=16-9=7 
h-2, fh-8 h=4 


Curve Fitting: Interpolation 307 


From Eq. (9.31), we have, for i = 1, 


r S 
hiag + 2h, + h)a, + йа; =6| f = h а 

Lo 1 
We know that ag = a; = 0. Thus, 
p aN | 

25 + Ta, -e[1-2] 
"Therefore, 
(6)(-2) 
а = 735) (24) = – 0.0143 


Since л = 3, there are two cubic splines, namely, 
s(x) xy SxSay, 
Six) xy SX SX 
The target point x = 7 is in the domain of s(x) and, therefore, we need to 
use only s,(x) for estimation. 
From Eq. (9.30) 
a, (u - hi uo) 
6h, 


upsx-xo and  ujex-i 


s(x) = x (шо - foui) 
hy 
Upon substitution of specific values, 


„00143 i4. 48 gig 
s 0-5 [07 -4*-5* 0-4 


-4lm0-4-20-91 
= 2.6229 


Algorithm 

Note that Eq. (9.32) form a tridiagonal system which is relatively simple 
to solve using Gauss elimination method. A detailed solution procedure 
to evaluate spline functions is given in Algorithm 9.1 


Natural cubic spline 


. Provide input data. 
Compute step lengths and form function differences. 
Obtain the coefficients of the tridiagonal matrix. 

Compute the right-hand side (D array) of the system. 

. Compute the elements a, using Gauss elimination method 


prep> 


(Contd.) 
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(Contd.) 


6. Evaluate the coefficients of natural cubic splines 
7. Evaluate the spline function at the point of interest. 
8. Print results. 


Program SPLINE 


Natural cubic splines interpolation uses Gauss elimination method to 
implement its algorithm. Program SPLINE, therefore, calls for the help 
of GAUSS subprogram to compute the array of second derivatives. 


7 Тет Ре? — — 
PROGRAM SPLINE 
"ү a al — йб 
* Main program * 
si This program computes the interpolation value ar * 
$ a specified value, given а set of table points, * 
* using the natural cubic spline interpolation * 
Sb ities UP a 2 s EAE 
* Functions invoked * 
E — o— = — "жашы 
* Subroutines used x 
* GAUSS * 
$ iables used * 
+ М - Number of data points. s 
* X - N by 1 array of data points. * 
* F - N by 1 array of function values * 
" XP - Point at which interpolation is required  * 
á Interpolation value at XP * 
. А Array of second derivatives (М-2 by 1) + 
* D Array representing right side of (9.32) * 
» (N-2 by 1) " 
* C - Matrix (N-2 by N-2) representing the * 
* coefficients of second derivatives * 
> Н - Array of distances between data points * 
kd (hti) = x(i) xti-1)) е 
* DF Array of differences of functions * 


- MAX - Maximum number of table points permitred 
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INTEGER N,MAX 

REAL XP,FP,X,F,A,D,C,H,DP,U 

PARAMETER (MAX-10) 

DIMENSION X(MAX),FIMAX) , A(MAX) ,D(MAX) ,C (MAX, MAX) , 
+ H(MAX) , DF (МАХ), U (MAX) 


* Read input data 


WRTTE(*,*) ‘Input number of data points n’ 
READ(*,*) N 
WRITE(*,*) ‘Input data points X(I) and function’ 
WRITE(*,*) ‘values F(I), one set in each line’ 
bo 5 I = 1,N 

READ(*,*) X(I), F(T) 
CONTINUE 


а 


WRITE(*,*) ‘Input XP’ 
READ(*,*) XP 
* Compute distances between data points 
* and function differences 
DO 10 I = 2.N 
HU) = X(I) = X(I-1) 
DF(I) = F(I) - Е(1-1) 
10 CONTINUE 


* Initialise C matrix 


DO 30 І = 2,N-1 


ро 20 J = 2, N-1 
C(I,J) = 0.0 
20 CONTINUE 


30 CONTINUE 
* Compute diagonal elements of C 


DO 40 I - 2,N-1 
С(т,1) = 2.0 * (H(I)+H(I+1)) 
40 CONTINUE 


* Compute off diagonal elements of C 
DO 50 I = 3,N-1 
C(I-1,I) = HT) 
С(1,1-1) = H(I) 
50 CONTINUE 


* Compute elements of D array 


00 60 I = 2,N-1 
D(I) = (DF(I+1)/H(I+1) - DF(I)/H(1)) * 6.0 


310 Numerical Methods 


60 CONTINUE 


* Compute elements of A using Gaussian elimination 
* Change array subscripts from 2 to п-1 to 1 to п-1 
* before calling GAUSS 


M = N-2 
DO 80 I = LM 
D(I) = D(T+1) 
DO 70 J = 1,M 
C(I,J) = С(1+1,7+1) 
70 CONTINUE 
80 CONTINUE 


CALL GAUSS(M,C,D,A) 
* Compute the coefficients of natural cubic spline 


po 90 I = N-1,2,-1 
A(T) = A(T-1) 
90 CONTINUE 


A(1) = 0.0 
A(N) = 0.0 

* Locate the domain of XP 
т=2 

100 IF( XP .LE. X(I) ) GO TO 110 
I= 11 
GO TO 100 


* Compute interpolation value at ХР 
* Use equation (9.30) 


110 U(I-1) = XP - X(I-1) 
U(I) = XP - X(I) 
Ql = H(I)**2 * U(I) - U(I)**3 
Q2 U(r-1)**3 - H(x)**2 * U(I-1) 
Q3 = F(I) * U(I-1) - P(I-1) * U(I) 
ЕР = (A(I-1) * Ql + A(I) * Q2)/(6.0 * H(I)) 
* Q3/H(I) 
* Write results 
WRITE(*, +) 
WRITE(*,*) 'SPLINE INTERPOLATION' 
WRITE(*,*) 
WRITE(*,*) ‘Interpolation value =',FP 
WRITE(*, *) 


STOP 


- End of main SPLINE- 
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SUBROUTINE GAUSS(N 


,A,B 


* 

* Subroutine 
$ This subroutine solves a set of п linear 
* 


* Arguments 

* Input 

ы м - Number of equaLions 

" A - Matrix of coefficients 
* B - Right side vector 

* Output 

^ X - Solution vector 

* pocal Variables 

» PIVOT, FACTOR, SUM 
. 

^ 

^ 


Functions invoked 
NIL 


* subroutines called 
* NIL 


INTEGER N 
REAL A,B,X, PIVOT, FACTOR, SUM 
DIMENSION A(10,10), BI10), X(10) 


* ----— = ---- Elimination begins - 


DO 331 K = 1, N-1 
PIVOT = A(K,K) 
DO 22 I = Krl, N 
FACTOR = A(I,K)/PIVOT 
ро 11 J = Kd, N 
A(i,J) = A(I,J) - FACTOR * A(K,J) 


11 CONTINUE 
B(I) = Б(І) FACTOR * B(K) 
22 CONTINUE 


33 CONTINUE 


X(N) = BIN) /AIN,N) 
Do 55 K 


-----------.- Baek substitution begins---------7---- 


өы» еа > & 


атка» 


* 
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SUM = SUM + A(K,J) * X(J) 


44 CONTINUE 
X(O = (B(K) - SUM)/A(,X) 
55 CONTINUE 
RETURN 
END 
* oL End of subroutine GAUSS---- ------ + 


Test Run Results Program SPLINE was tested using the table of data 
points given in Example 9.11. 


Results are given below: 
Input number of data points n 
3 
Input data points, X(1) and function 
values F(1), one set in each line 
42 


16 4 
Input XP 
7 


SPLINE INTERPOLATION 
Interpolation Value -, 2.6228570 


Stop - Program ters 


Equidistant Knots 
Most often the knots are equally spaced, This would simplify the solu- 
tion considerably. If the knots are equally spaced, 
hy =hy=...=h, =h, 
Substituting this in equations (9.11) and dividing throughout by h, we get 


4 1 

1 

0 1 

і (9.33) 
0 

0 0 


where 


D, 6.,-2f fi) 
TR — 
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"TT 
= блед 


= 12 fly д Moa 


Шш 


Given the table of values 


| | EON CE NE 

| i 0 К 1 2 3 

x 1 $ 3 4 
fix) | 05 [ 0.3333] 025] 030 


estimate the value of /(2.5) using cubic spline functions 


"The points are equally spaced and therefore 
hy =hg =hyg=1 


Since a = 4, we have three intervals and three cubics and, therefore, 
only a, and ay are to be determined. From Eq. (9.33), we have 


li Qna]. [à 
l1 4|[а | |4, 
T 
itas fe A +h) 
= 6 (0.25 - 2 х 0.3333 + 0.5) 
= 0.5004 
dy =F fs - a +f) 


= 6(0.2 — 2 x 0.25 + 0.3333) 
= 0.1998 
Solving for a, and аз 
d,x4-d,x1 
а= = = 


= 9.5004 0. 1998 =0.1201 


Ё 
Í The target point х = 2.5 is in the domain of s,(x). Using Eq. (9.80), 


ade) = Fela - ug) + Stu! ш) + = fiti) 


i = Biz) - 6x] Ben” 
| а + Дх x) = fi — xol 


‚ Upon substitution of values, we get 
{ 3,2,5) 02201025 -3)-(55- 39] 


! +00199 025 = 2) -(2.5- 2) 


} + (0.25) (2.5 — 2) — 0.3333 (2.5 - 3) 
= — 0.0075 — 0.0012 + 0.125 + 0.1667 
= 0.2829 


es CHEBYSHEV INTERPOLATION POLYNOMIAL 


Recall that the truncation error in approximating a function f(x) by ah 
interpolating polynomial p,(x) with interpolation points x;, i = 0,1,...n is 
(л #1)! 


Го) — р(х) =ш„(х) 


where 

w,(x) = (x — ху) x — ху). (x —x,) 
and @ is some point in the interval of interest. One of the goals while 
applying an interpolation polynomial is to minimise the truncation er- 
ror. Since f“ * 00) is not in our control, we can try to minimise the 
absolute value of w,(x). This ean be done by choosing a proper set of 
interpolating points x; in the given interval (a, b). 


Chebyshev Points 


The Russian mathematician Chebyshev showed that the error bound is 
minimum when the interpolation points are chosen as follows: 


a+b a-b 2k+1 
act yet е, 3 *| k=0,1,..,n (9.34) 
These values are called Chebyshev nodes (or points). We can evaluate 
function values at these points. That is 

f= fen) 
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Now, we can apply the Lagrange interpolation method to the Chebyshev 
points and the corresponding function values to obtain an interpolation 
уо Кпозуп as Lagrange - Chebyshev interpolation polynomial. 


Chebyshev Polynomials 
Another approach to construct the interpolation polynomial p,(x) is to 
use Chebyshev polynomials as basis polynomials. That is 

рх) = Cot + CyT y(t) +... + CT) 


Š Ўст W (9.35) 


where T(t) is the Chebyshev basis polynomial of order i in ¢ and C, the 
Chebyshev coefficient, Equation (9.35) is known as Chebyshev interpola- 
tion polynomial. Chebyshev polynomial Т?) is given by 


Туй = 1 
Түй =! 
(0) = 2t T, _ (0 — T, 4) k-22,..n 
C, are computed as follows: 


12 
Co= эл 3. FG) To (ty) = "EE 2 Fx) 
; > ти 
67311 fo? AUS) 
where 
" (2k « 0л] 
тш) =н) 3nD | 
"Therefore 


2 2 Okt De] _ 
TE FD je12..,n 


Evaluation of p, (x), given x: 


x-(b*ay2 


t=- Bae 


pe) = È CT, 
EI 


poo; SUMMARY 


In this chapter, we discussed various methods for constructing interpo- 
lation polynomials for tables of well-defined functions. They include: 
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* Lagrange interpolation 

* Newton's interpolation 

* Newton-Gregory forward interpolation 

* Spline interpolation 

To facilitate the construction of interpolation functions, we presented 
different forms of polynomials that included 

* power form 

+ shifted power form 

* Newton form 

We have also discussed how to build different types of difference 
tables and how to use them for estimating function values at any point. 
Finally, we considered how Chebyshev points and Chebyshev polynomi- 
als may be used to minimise the truncation error. 

We have given computer programs and test results for the following 
methods: 

* Lagrange interpolation 

• Newton's interpolation 

* Spline interpolation 


[pem] 


Approximating functions 


Leat-squares polynomials 


Backward difference Leat-squares regression 
Central cubic spline Linear interpolation 

Central difference Natural cubic spline 
Chebyshev basis polynomial Newton form 

Chebyshev interpolation Newton interpolation polynomial 
Chebyshev points Newton's interpolation 
Chebyshev polynomial Newton-Gregory formula 
Cubic spline Newton-Gregory interpolation 
Curve fitting Nodes 

Divided difference table Piecewise polynomial 

Divided differences Power form 

Forward difference Shifted power form 
Interpolation Simple difference 

Interpolation function Spline function 4 
Interpolation polynomial Spline interpolation 

Knots Spline interpolation polynomial 
Lagrange basis polynomial Splines 

Lagrange interpolation Taylor expansion 

Lagrange interpolation polynomial ^ Tridiagonal system 
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Ж. What is curve fitting? West is the need for such an exercise? 

A. What is interpolation? 7 

8. What are the methods available for interpolation? 

A. Discuss the possible sources of errors in interpolation? 

‘B. What is interpolation function? 

4. List, with examples, different forms of polynomials that could be 

used for constructing interpolation functions. 

f. Given two points (о, у) and (xy, уз), state the linear interpolation 

formula in terms of these pointe Ы 

„©. Given a set of n + 1 points, state the general form of nth degree 
Lagrange interpolation polynomial. 

9. What is the computational effort required in using Lagrange poly- 

nomial? 


us is the major re of using Lay olyn 
ei 


-What are divided differences? 
‚ State the second order Newton’s divided difference interpolation 


‘polynomial 
. How is the Newton's interpolation formula better than Lagrange 
Mf formula? à 
14, What is a divided difference table? How is it useful? 
JE Cou a divided difference table for four data points. 
16. Entries under a particular column in a divided difference table are 
constants. What does it indicate? 
17/Distinguish between the simple difference and divided difference. 
КА What is the difference between the forward difference table and 
backward difference table? 
19. Look at Examples 9.8 and 9.9. Answers are different. Why? 
20. What are piecewise polynomials? 
21. What are spline functions? 
29. What is spline interpolation? 
23. What are cubic splines? 
24. State the conditions for a spline to be cubic. 
25. What are natural cubic splines? 
26. What is tridiagonal system? 
27. State the contribution of Russian mathematician Chebyshev in mini- 
mizing the truncation error in interpolation. 


g 


H 
1/Cotistruct the power form of the straight line p(x) which takes on 

„Л values 

Va p(200) = 1/3 


2 
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p(202) = - 2/3 
using four-digit floating-point arithmetic. 
2. Solve the problem in Exercise 1 using the shifted-power form and 
compare the results, . 
3. Find the linear interpolation polynomial for each of the following 
pairs of points: 
(a) (0, 1) and (1, 3) 
(b) (-2, 3) and (7, 12) 
4. Find the quadratic interpolating polynomial for each of the follow- 
ing set of points: 
(a) (-1, 1), (0, 1) and (1, 3) 
(b) (0, 1), (1, 0) and (2, 9) 
5, "l'able below gives values of square of integers: 


id alils ri [415] 
E 1 4 9 | 16 | 25 | 


(b) using the points 2 and 4 
Compare and comment on the results. 

E sing the data in Exercise 5, estimate the square of 3.25 using the 
second-order Lagrange formula. Compare the error with the errors 
obtained in Exercise 5. 

7, When the value of x at which we wish to estimate the value of fix), 
ø lies outside the given range, we call it extrapolation. Use the 
* Lagrange formula to find the quadratic equation that takes the 

following values: 


— T 
x 1. | 2 
folaila 


ind fix) atx = 0 and x - 4 
8. Given the points below, obtain a cubic polynomial using the 
Lagrange formula: 


x | 0 1 | 2 a 
ra | 11а [а [0 | 
$i Find the Lagrange interpolation polynomial which agrees with the 
following data: 
r] TU IL TI | 12 | 
cosx | 0.5403 | 0.4536 | 0.3624 
Use it to estimale cos 1.15 


10. 


16. 


bw 


18. 


39. 
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Find the polynomial of degree three to fit the following points: 


T Т 
jaj 0 


[з | 


fix) | —6 
| 1 


2 | 


1 
2 


10 


. Show that when л = 2, Lagrangian interpolation formula reduces 


to the linear interpolation formula. 


. The Lagrange interpolation polynomial can be derived directly from 


Newton’s interpolating polynomial. Prove this using the linear case. 
Fit a second-order Newton's interpolating polynomial to estimate 
cos 1.16 using the data from Exercise 9. 


. Fit a third-order Newton's interpolating polynomial to estimate cos 


1,15 using the data from Exercise 9 along with the additional point 
cos 1.8 = 0.2675. 


. Given the data 


Dx | m | s j ta | 18 | 
[fe | 1%% Trost | i119 | 114 | 
(a) Calculate (1.35) using Newton's interpolating polynomial of 
order 1 through 3. Choose base points to attain good accu- 
racy. 
(b) Comment on the accuracy of results on the order of polyno- 
mial. 
Find the divided differences fixo, х1), fle, ху] and хо, xy, ху] for 
the data given below. 
Я а! : 
xj К 
fo | 5 5 | 
Also find the divided differences /|xp, ху] and fixo, x2, x,]. Compare 
the resulta f[xo, Ху, х2] and f[xo, ху, х1]. 
Estimate the value of In (3.5) using Newton-Gregory forward dif- 
ference formula given the following data: 


[s 
[m 
Repeat Exercise 17 using Newton's backward difference formula. 


Compare the accuracy of results. 
Construct difference tables for the following data: 


= [oa | 
f) | 0.003 


Find f(0.6) using a cube that fits at x = 0.3, 0.5, 0.7 and 0.9. 


т 
io | 20 | 30 | 40 | 
0.0 | 06931 | 10986 | 1.3863 | 


13 
0.697 


оз | os | от] оэ | 11 
$087 | 0.148 | 0:248 |0370 [0.518 
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20. What is the minimum degree of polynomial that will exactly fit all 
seven pairs of data in Exercise 19. 

21. Construct a divided difference table for the data in Exercise 19. 
How do the values compare with those in the table obtained in 
Exercise 19. 

22. State whether the following functions are Splines or not. 


x 0<х<1 
2 
(а) fi 212713 leres 


5x-8 З<х<4 


х2-3х+1 0<х<1 
(b) fx) - 1x3 &x?-3. 15х50 
х%+5х-9 2«xy«3 


-x+5.5 3.0<х<4.5 
(с) fix) = 0.64x? ~6,76x + 18.46 4.55хх7.0 
-L6x* + 24.6х – 91.3 7.0<2<9.0 
28. Find the values of a and 6 such that the function 
afa? -х+1 1сх<2 

FOS] sk&-& св 


is a quadratic spline. 
24. Fit quadratic splines to the data given below: 


5 l 


Predict /(2.5). 
25, Develop cubic splines for the data given below and predict f(1.5) 


E A oļi 2 E 
Го) % | = -1 0 
26. Given the data points 
E 0 sol E 3 
x | 10 | 80 4.0 70 
ЕС 15 45 | 90 Jr 255 


Estimate the function value at x = 1.5 using cubic splines, 


27. 


28, 
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The velocity distribution of a fluid near a flat surface 15 given 
below: 


v 


x Гы оз [os [oz Гов 


re a 
972 | 181 |273 [347 | зэв 


x is the distance from the surface (cm) and v ів the velocity (em/ 
sec). Using a suitable interpolation formula obtain the velocity at 
X = 0.2, 0.4, 0.6 and 0.8. 

The steady-state heat-flow equation /(x,y) is solved numerically 
and temperature values obtained at the pivotal points of a grid in 
the domain of interest are tabulated below. (This type of problems 
are discussed in Chapter 15). 

Table of f(x, y) 


у 05 | 10 | 15 | ii 


x 
03 | 150 | 210 +E 31.0 


| 10 20.0 20.0 | 200 
| 15 25.5 19.0 | 90 
[20 | 309 | 200 | 106 [T wo 


Solution of heat-flow equations by numerical methods gives infor- 
mation only at the nodes and not at the intermediate points. We 
are interested in the temperature at the point (1.25, 1.25). Esti- 
mate this value using the data available in the table. 


‚ Write subprograms 


(a) COSPLN to compute the coefficients cubic splines, and 
(b) VSPLN to evaluate the spline function at the specified point. 


. Write an interactive main program that will read the given set of 


table points and the point of interest, estimate the interpolation at 
the specified point using the subprograms COSPLN and VSPLN 
developed in Project 1, and then print the results. 


- Write а program to evaluate forward differences and print a for- 


ward difference table for a set of n function values, 


- Following is a table that lists values of cube roots of numbers from 


1.0 to 2.0 in steps of 0.1. 


Р 


Ye | 


z же 1 j] | 
| 10 CrO 
| Жыш! | 


| | ge 
| 10 1 | te] ia] 14 э m ERR: 20 


irm [umo une iid E 
і н. 
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Write a program for linear interpolation of this table of data and 
produce another table of cube roots for numbers 1.25 to 1.75 in 
steps of 0.05 shown as follows: 


L175 


5. Modify the program in Project 4 to produce the following table: 


| x Interpolated cube True value of Yx Error 
root of x Е 


30 | 


6. Using a table of cosines, accurate to four digits, write a program to 
implement the following tasks: 
(a) Read the cosine of 0°, 10°, . . . 90° 
(b) Compute the cosine of angle for any value between 0° and 
90° using linear interpolation. 
(c) Compare the results of (b) with the output of intrinsic cos 
function. 
7. Write a program to estimate a value f(x, y) from a given table of 
values of x and у by interpolation. 
Test your program by solving the problem in Exercise 28. 


Curve Fitting: 
Regression 


ЁЛ INTRODUCTION 


In the previous chapter we discussed various methods of curve fitting for 
data points of well-defined functions. In this chapter, we will discuss 
methods of curve fitting for experimental data. 

In many applications, it often becomes necessary to establish a math- 
ematical relationship between experimental values. This relationship 
may be used for either testing existing mathematical models or estab- 
lishing new ones. The mathematical equation can also be used to predict 
or forecast values of the dependent variable. For example, we would like 
to know the maintenance cost of an equipment (or a vehicle) as a func- 
tion of age (or mileage) or the relationship between the literacy level and 
population growth. The process of establishing such relationships in the 
form of a mathematical equation is known as regression analysis or 
curve fitting. 

Suppose the values of y for the different values of x are given. If we 
want to know the effect of x on y, then we may write a functional 
relationship 

у=) 
The variable у is called the dependent variable and x the independent 
variable. The relationship may be either linear or nonlinear as shown in 
Fig. 10.1. The type of relationship to be used should be decided by the 
«periment based on the nature of scatteredness of data. 

It is a standard practice to prepare a scatter diagram as shown in Fig. 
10.2 and try to determine the functional relationship needed to fit the 
points. The line should best fit the plotted points. This means that the 
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average error introduced by the assumed line should be minimum. The 
parameters a and b of the various equations shown in Fig. 10.1 should 
be evaluated such that the equations best represent the data. 


x= 
Fig. 10.1 Various relationships between x and y 


We shall discuss in this chapter a technique known as least-squares 
regression to fit the data under the following sitnations: 

1. Relationship is linear 

2. Relationship is transcendental 

3. Relationship is polynomial 

4. Relationship involves two or more independent variables 


№021 FITTING LINEAR EQUATIONS 


Fitting a straight line is the simplest approach of regression analysis. 
Let us consider the mathematical equation for a straight line 

y-a-*bxzf(x) 
to describe the data. We know that a is the intercept of the line and b its 
slope. Consider a point (x; y) as shown in Fig. 10.2. The vertical distance 
of this point from the line fix) = а + bx is the error q;. Then, 

qay Го) 

2y;-a-bx, (10.1) 
"There are various approaches that could be tried for fitting a “best” line 
through the data. They include: 
1. Minimise the sum of errors, i-e., minimise 


Xa, = Ly, -a-bx,) (10.2) 
2, Minimise the sum of absolute values of errors 
іа = Ely: - a bx. 0 (10.3) 


3. Minimise the sum of squares of errors 


Ya? =D, -a-bx,)* (10.4) 
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Line fx) = a + bx- 


<- Error, q; 
_-— Point(x;, y) 


x—- E] 


Fig. 10.2 Scatter diagram 


It can be easily verified that the first two strategies do not yield a 
unique line for a given set of data. The third strategy overcomes this 
problem and guarantees a unique line. The technique of minimising the 
sum of squares of errors is known as least squares regression. In this 
section we consider the least-squarcs fit of a straight line. 


Least Squares Regression 
Let the sum of squares of individual errors be expressed as 


Q= Ža? - $i», - f) 


=3(y, а-а)? 105) 
“ 
In the method of least squares, we choose a and b such that Q is mini- 
mum. Since @ depends on a and b, a necessary condition for Q to be 
minimum is 


99 AQ _ 

Du 0 and 379 
Then 

90.2 $y, -a-be,)=0 

да  & 

a А 

56 5-2 Ër; -а-Мң)=0 (10.6) 
Thus 

Lyi =па+Ьух, 


XxyicaXx o +b Lx? (10.7) 
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These are called normal equations. Solving for a and b, we get 


y nEun -XnE» 


nXsi-(ExY 


(10.8) 


where X and y are the averages of x values and y values, respectively. 


OO 1] 0 _ 


x L u-- mc 4 T 
Lx cl os [L w p а $ | 


The various summations are given as follows: 


Using Eq. (10.8), 


| 5x90-15x26 _ 


b- 2120 


5x55- 15° 
26 15 
ase 120х-5=160 


Therefore, the linear equation is 
у= 16 +12х 
The regression line along with the data is shown їп Fig. 10.3. 


|: 


e 


у= 16 +12х 


7 294 5 6 x—- 
Fig. 10.3 Plot of the data and regression line of example 10.1 
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It is relatively simple to implement the linear regression on a computer. 
The coefficients a and b can be evaluated using Algorithm 10.1 


. Read data values 
. Compute sum of powers and products 


Ex, Ly), Ex?, Exyy, 


. Check whether the denominator of the equation for b is zero. 


. Compute b and а: 
. Print out the equation. 


Interpolate data, if required. 


Program LINREG 


Program LINREG implements Algorithm 10.1. The program reads a 
table of data points and decides a straight line equation to fit the data 


* * 
* Main program P 
® This program fits a line Y = A + BX toa given * 
* вес of data points by the method of least squares d 
* T" d 
* * 
* * 
* * 
* Subroutines used » 
* NIL ў * 


* 


* Variables used 

* X, Y - Data arrays ` 

s N - Number of data sets 

*  8UMX - Sum of x values 

*  SUMY - Sum of y values 

* — SUMXX - Sum of squares of x values 
* 

* 

* 


SUMKY - Sum of products of x and y 
XMEAN - Mean of x values 


YMEAN - Mean of y values 
A - y intercept of the line 


жа-ж » €» »* o» o 
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* Constants used 
* MAX - Limit for number of data points 


INTEGER MAX,N 

REAL X,Y,SUMX, SUMY , SUMKX, SUMXY , XMEAN, YMEAN , DENOM, A, B 
INTRINSIC ABS 

PARAMETER( MAX = 10 ) 

DIMENSION X(MAX),Y(MAX) 


WRITE(*, *) 
WRITE(*,*) "LINEAR REGRESSION’ 
WRITE(*,*) 

* Reading data values 


WRITE(*,*) ‘Input number of data points N' 
READ(*,*) N 
WRITE(*,*) ‘Input X and Y values,', 
D ‘one set on each line’ 
DO 10 T= 1, N 
READ(*,*) X(T), Y(I) 
10 CONTINUE 


* Computing constants A and B 


SUMX - 0.0 
SUMY - 0.0 
SUMXY = 0.0 
SUMXX - 0.0 
po 20 1= 1, М 


SUMX = SUMX + X(I) 

SUMY = SUMY * Y(I) 

SUMXX = SUMXX + X(T) * X(T) 

SUMXY = SUMXY + X(I) * Ү(І) 
20 CONTINUE 


XMEAN SUMX/N 
YMEAN SUMY/N 
DENOM = М * SUMXX - SUMX * SUMX 
IF (ABS (DENOM) .GT. 0.00001) THEN 
B = (М * SUMXY - SUMX * SUMY)/DENOM 
A = YMEAN - B * XMEAN 
ELSE 
WRITE(*,*) 
WRITE(*,*) "NO SOLUTION’ 
STOP 
ENDIF 


* Printing results 
WRITE(*,*) 
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WRITE(*,*) ‘LINEAR REGRESSION LINE Y = А + BX’ 


WRITE(*,*) 

WRITE(*,*) 'THE COEFFICIENTS ARE:' 
WRITE(*,*) " A=^ A 
WRITE(*,*) * B=*, B 
WRITE(*,*) 


End of main LINREG-----------..... + 


Test Run Resulls Shown below is the interactive data input and the 
linear regression line parameters computed by the program LINREG. 


LINEAR REGRESSION 
Input number of data points N 
E: 
5 
Input X and Ү values, one set on each line 
1 3 
25 
gw 
49 
5 11 
LINEAR REGRESSION LINE Y = A + BX 


THE COEFFICIENTS ARE: 
А = 1.0000000 
В = 2.0000000 
Stop - Program terminated. 


re re A 
[385] FITTING TRANSCENDENTAL EQUATIONS 


The relationship between the dependent and independent variables is 
not always linear. Look at Fig. 10.4. The nonlinear relationship between 


ia, 


Fig.10.4 Data would fit a nonlinear curve better than a linear опе 
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them may exist in the form of transcendental equations (or higher order 
polynomials). For example, the familiar equation for population growth 
is given by 

Р = руе“ (10.9) 
where ру is the initial population, Ё is the rate of growth and t is time, 
Another example of nonlinear model is the gas low relating to the pres- 
sure and volume, as given by 

p-ai (10.10) 
Let us consider Eq. (10.10) first. If we observe values of p for various 
values of v, we can then determine the parameters а and b. Using the 
method of least squares, the sum of the squares of all errors can be 
written ав 


Q- $i; - au? 
izl 
To minimise Q, we have 
2Q 
Sa" 0 and 
We can prove that 
Xp? sagut) 


Уро? Inv; =a Dw!) Inv, 
These equations can be solved for a and b. But since b appears under the 
summation sign, an iterative technique must be employed to solve fora 
and b. 
However, this problem can be solved by using the algorithm given in 
the previous section in the following way: let us rewrite the equation 
using the conventional variables x and y as 


Ж узах 
If we take logarithm on both the sides, we get 
Iny=Ina+blnx (10.11) 


This equation is similar in form to the linear equation and, therefore, 
using the same procedure we can cvaluate the parameters a and b. 


jy "Хах: п), Linx Уу 
пу x.) -(Elnx;)* 


(10.12) 


masR-l(ymy —bYIn x; ) 


а = (10.13) 
Similarly, we can linearise the exponential model shown in Eq. (10.9) 
by taking logarithm on both the sides. This would yield 


InP = 1 Po + Ine 
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Since, Ine=1, | 
we have In P= аР, +kt (10.14) 
This is similar to the linear equation 


ysatbx 
where y = In P, a = In Po, b = k, and x = t. We can now easily determine 
a and b and then P, and k. 
There is a third form of nonlinear model known as saturation-growth- 
rate equation, as shown below: 
kıt 
т" hy +t 
This can be linearised by taking inversion of the terms. That. is 
a (2) (10.16) 
p 


This is again similar to the linear equation 
y-a*bx 


(10.15) 


where 


Once we obtain a and b, they could be transformed back into the original 
form for the purpose of analysis. 


Кыш 1 


Given the data table 


x 1 2 s | af s 
y 05 2 45 8 125 


fit а power-function model of the form 


Various quantities required in equation (10.12) are tabulated below: 
a|n | nx In y, (nz | (in x,) (In yJ 
05 | 0 - 0.8931 o | o 


i 

2 | 2 | 0.6931 | 0.6931 | 0.4805 | 0.4804 
3 | 45 | 10986 | 15041 | 1.2069 | 16524 
4 
5 


8 | 1.3863 2.0794 1.9218 2.8827 
125 | 1.6094 2.5257 2.5903 4.0649 
Sum 4.7874 6.1092 6.1995 9.0804 
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Using Eq. (10.12), 


z (5)(9.0804) — (4. 7874)(6. 1092) 
/ (86.1995) — (4.7874)? 


Zz 45.402 — 29,2472 
. 80.9975 - 22.9192 


-19998 . 
2-6 
ша 93092 Bonum T 
P" ү“ 
=-0.6929 z a 
а = 0.5001 EJ = 


Thus, we obtain the power-function equation as 
y = 0.5001 xime 
Note that the data have been derived from the equation 


The temperature of a metal strip was measured at various time inter- 
vals during heating and the values are given in the table below: 


ümermi | 1 | 2 | 3 [| 4 
temp TCO | 70 | 83 100 124 


If the relationship between the temperature 7 and time t is of the 
form. 


T = bel +a 
estimate the temperature at 
We can write the temperature equation in the form 

у= Бх) +а 


This is similar to the linear equation except that the variable x is re- 
placed by the function f(x). Therefore, we can solve for the parameters a 
and 6 using Eq. (10.8) by replacing 


x by fix) 
Xx by Lhe) $ 
Ex?) by Xf 
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| p MEM ey.) -Ife Zy 
n FI)? [ао] 


а СВУ) 


п 


We can vet ир the following table to obtain the various terms. Note that 
f(x) =e, 


x y f(x) yf) fa? 

1 70 1.98 89.89 1.65 

2 83 165 13684 272 

3 100 212 211.70 448 
|4__14___272 33107 1-39 

Sum 377 771  Ti55 1624 


Now, 


(4715.5) - (7.77) (877) 
7 4006.24 - (7.77)? 


= 37.62 


a JI - 7.62) (1.77) 
4 
-2116 
The equation is 
Т = 37.62 е°®% + 21.16 
The temperature, when t = 6, is 
T = 37.62 59 + 21.16 


= 189.76°C 


[3641 FITTING A POLYNOMIAL FUNCTION 


When a given set of data does not appear to satisfy a linear equation, we 
can try a suitable polynomial as a regression curve to fit the data. The 
least squares technique can be readily used to fit the data to a polynomi- 
al. 
Consider a polynomial of degree m — 1 
у=ау+аух+аух® ea, 3771 (10.17) 


=f) 
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If the data contains n sets of x and y values, then the sum of squares of 
the errors is given by 


Q= Xly, - f)? (10.18) 
=1 
Since f(x) is a polynomial and contains coefficients «;, пз, аз, etc., we 


have to estimate all the m coefficients. As before, we have the following 
m equations that can be solved for these coefficients. 


0 


А Әр) 
2$ а = 
f) yu 


да, 7i 


Thus, we have 


Xl, - fils?! =0 j=1,2, m 
ist 


Ы Е 
Substituting for f(x) 
xn 03x, +азх? *osayxp d) Е Ens * 
i [= 
These are m equations ( j = 1, 2...т) and each summation is for i = 1 to n. 
a,n*as Xx, tas Ex? +... +а„ Exp = Ун 
o,Ex;*a; Ул? +аз Ух+.. tom Ух" =Lyix; (1019) 


а Уж" 4a, Ухт жаз Ух". tan Dee Ууу) 
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The set of m equations can be represented in matrix notation as follows: 


CA=B 
where 
n Ee ар xaT 
Ух Xa X4 Уху 
Са m 
Xs фар Eu 
[ау Exi 
аз Lyx 
А =| a B=| Уух; 
ам [Хэт 


The element of matrix С is 
CU b-Xz/ 57 уела „т and k=1,2,..,m 
я 
Similarly, 
B= 5y]  ў=1,2 т 
“ 
Fit a second order — to the data in the table below: 


x 4.0 
E "s us es 27.0 


The order of polynomial 
equations as shown below: 


2 and therefore we will have 3 simultaneous 


вап+а; Ух; +аз Xx? = Xy, 


а Ух, +a: Ух? +a3 уух -Yyix, 


а Ух? +a: Уи? +a3 Ezt = Уух? 
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The sums of powers and products can be evaluated in a tabular form as 
shown below: 


x y Ү T ё | # ух | эх? 
1 6 1]| 1 1] 6 6 
2 11 4 8 16 22 44 
3 18 9 27 81 54 | 162 
а | 27 | 16 64 | 256 | 108 | 432 
x| 10 62 30 100 | 354 190 | 644 | 


Substituting these values, we get 
дау + 10a, +300, = 62 
10a, + 30a, + 100a, = 190 
30a; + 1000; + 354a, = 644 
Solving these equations gives 
а= 3 


аз=2 
ay=1 
‘Therefore, the least squares quadratic polynomial is 
y=3+2x+x° (verify using table data) 


Algorithm for Polynomial Fit 


The set of т equations given by Eq. (10.19) can be solved by using an 
elimination method discussed in Chapter 7. Algorithm 10.2 lists the 
steps involved in computing the coefficients of the regression polynomial. 


Polynomial Regression 


Read number of data points n and order of polynomial mp 
Read data values 
it n< mp, 
print out ‘regression is not possible’ and stop; 
else 
continue 


. Set m=mp+1 
. Compute coefficients of C matrix 
. Compute coefficients of B matrix 
. Solve for the coefficients a), a, ... а 
Write the coefficients 
. Estimate the function value at the given value of independent var- 


| Algorithm 10.2 
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Program POLREG 


This program fits a polynomial curve to a given set of data points by the 
method of least squares. POLREG uses a subprogram NORMAL to com- 
pute the coefficients of normal equations and another subprogram GAUSS 
to solve the normal! equations obtained. Finally, the program prints the 
polynomial coefficients, a(1) to alm). 


PROGRAM POLREG 


* Main program Ы 
* This program fils a polynomial curve to a given * 
а set of data points by the method of least squares a 
* 
^ 
* 


* Functions invoked 


* NIL 

W стекан соса. Й 
* Subroutines used > 
ш IORMAL, GAUSS * 


* Variables used s 
x X,Y - Arrays of data values » 
* N - Number of data points * 
* MP - Order of the polynomial under construction  * 


* M - Number of polynomial coefficients Ы 
^ C - Coefficient matrix of normal equations * 
* B - Right side vector of norma! equations ы 
~ $ Array of coefficients of the polynomial © 
ts * 
* Constants used 9 
* МАХ - Maximum number of data points м 


REAL Х,Ү,С,А,В 
INTEGER М,МР,М,МАХ 
РАВАМЕТЕВ (МАХ = 10) 
DIMENSION Х (МАХ), Ү(МАХ),С(МАХ, МАХ), A (МАХ), B(MAX) 
WRITE(*, +) 
WRITE(*, *) 'POLYNOMIAL REGRESSION’ 
WRITE(*,*) 
* Reading data values 


WRITE(*,*) ‘Input number of data points{N)’ 
READ(*,*) N 
WRITE(*,*) ‘Input order of polynomial(MP) required’ 
READ(*,*) MP 
WRITE(*,*) "Input data values X and Y,', 

+ ‘one set on each line’ 
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DO 101 - 1, N 
READ(*,*) X(I), Y(1) 
10 CONTINUE 


* Testing the order 


IF(N.LE.MP) THEN 
WRITE(*,*) 'REGRESSION IS NOT POSSIBLE' 
GO TO 20 

ENDIF 


* Number of polynomial coefficients 
M = MP+l 

* Computation of elements of С and B 
CALL NORMAL (X,Y, C, B,N, M, MAX) 

* Computation of coefficients a(l) to alm) 
CALL GAUSS (M,C, B,A) 

* Output of coefficients all) to a(m) 
WRITE(*,*) 
WRITF(*,*) ‘POLYNOMIAI, COEFFICIENTS" 
WRITE(*,*) 
WRITE(*,*) (A(1), 1-1,М) 
WRITE(^,*) : 


20 STOP 
END 


- End of main program POLREG - 


SUBROUTINE NORMAL (X, Y,C,B,N,M, MAX) 

IR MS ete nar 

* Subroutine 

* This subroutine computes the coefficients 
* of normal equations 


* Arguments 

Input 
N - Number cf Gata poirts 
X,Y - Arrays of data values 
M - Number of coefficients of the polynomial 
MAX - Maximum size of arrays 

Output 
С - Coefficient matrix of normal equations 
B - Right side vector of normal equations 


Local Variables 
NIL 


eee ER EE EP 9 


* 


10 
20 
30 


40 
50 


Functions invoked 
NIL 


Subroutii 
NIL 


S cal 


REAL Х,Ү,С,В 
INTEGER  N,M,MAX 


DIMENSION Х (МАХ), Ү (МАХ), С (МАХ, МАХ), В (МАХ) 


DO 30 J=1,M 
DO 20 K=1,M 
C(I,K) = 0.0 
Ll = KiJd-2 
DO 10 I-1,N 


C(J,K) = Cld,K) + X(T) ** L1 


CONTINUE 
CONTINUE 
CONT INDE 


DO 50 J- 1,M 
B(J) - 0.0 
L2 = J-1 
DO 40 І = 1,N 
B(J) = BIJ) + ¥(I) * X(I) 
CONTINUE 
CONTINUE 


RETURN i 


SUBROUTINE GAUSS(N,A, B, X) 


* L2 


n linear 


Subroutine 
This subrcutine solves a set of 
equations by Gauss elimination method 
Arguments 
Input 
М - Number of equations 
A - Matrix of coefficfents 
B - Right side vector 
Output 


X - Solution vector 


Local Variables 
PIVOT, FACTOR, SUM 
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* Functions invoked 
^ NIL 


* NIL Ж 


REAL A,B,X, РІУОТ, FACTOR, SUM 
DIMENSION A(10,10), B(10), Х(10) 
*oo-------------- Elimination begins ---------------- * 


DO 33 K - 1, N-1 
PIVOT - A(K,K) 
DO 22 I = K+l, N 
FACTOR = A(I,K)/PIVOT 
DO 11 J = Kel, N 
A(I,J) = A(I,J) - FACTOR * A(K,J) 
11 CONTINUE 
B(1) = B(I) - FACTOR * B(K) 
22 CONTINUE 
33 CONTINUE 
*о------------- Back substitution begins ------------- 


X(N) = B(N)/A(N,N) 
DO 55 K = N-1,1,-1 


sum = 0 
DO 44 J = K+1,N 
SUM = SUM + A(K,J) * x(a) 
44 CONTINUE 
X(K) = (B(X) - SUM) /А(К, К) 
55 CONTINUE 
RETURN 
END 
to End of subroutine GAUSS ------------- * 


Тез! Run Results The program was used to fit a polynomial curve to the 
following data points: 


х, 10 21 32 [| 49] 


» 20 | 25 30 10 


The results are given below: 


POLYNOMIAL REGRESSION 
Input number of data points (№) 
4 
Input order of polynomial(MP) required 
2 
Input data values X and Y, one set on each line 
1.0 2.0 
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POLYNOMIAL COEFFICIENTS 
2.0740160 -2.053067E-001 1.6804415-001 


Stop - Program terminated. 


eral MULTIPLE LINEAR REGRESSION 


There are a number of situations where the dependent variable is a 
function of two or more variables, For example, the salary of a salesper- 
son may be expressed as 
у= 500 + Sx, + 8x3 
where x, and x, are the number of units sold of products 1 and 2, 
respectively. We shall discuss here an approach to fit the experimental 
data where the variable under consideration is a linear function of two 
independent variables, 
Let us consider a two-variable linear function as follows: 
у=ау+аух +ау (10.20) 
The sum of the squares of errors is given by 


А 
Q7Xi -a, -азх, -азг,)? 
= 


Differentiating with respect to a}, a; and ay, we get, 


dQ 

Ja, 2X0 ira cay c ag р 
a 

29 


Ja." -25 (y, -a - az, - asz) y, 
Setting these partial derivatives equal to zero results in 
na; *(Yx.)a *(Ez,)as = Уу, 
(Lx Jar +(Ex? Jas «(Exizi)u = Уу 
(Zz; )a, *(Exizi)as +(Zz? Jas = уг; 


These аге three simultaneous equations with three unknowns and, 
therefore, can be expressed in matrix form as 


п Уух Уг: Га Xx 
Ух Xi Enz Is}. Xxx (10.21) 
Ea Уха Уг? e (Eviz 
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This equation can be solved using any standard method. This is a 
two-dimensional case and, therefore, we obtain a regression “plane” rather 
than“line”. 

We can easily extend Eq. (10.21) to the more general case 


Y = Gy + yxy +азху +... + met Xm 


Given the table of data 

x 1 | $3 3 И 4 

z o | 1 2 3 
y 2 | | и | a0 


Obtain a regression plane to fit the data. 


The various sums of powers and products required for evaluation of 
coefficients are tabulated below: 


[al aly |2] 2| al r| r 

ry | 12 1 0 0 үө 
2 1 18 | 4 1 2 | 36 | 18 
3 | 2 24 | 9 4 6 | 72 | 48 
4! 3 30 | 16 9 | 12 | 120 | 90 

xp 10] 6 84 | 30 14 | 20 | 240 | 156 | 

| ATUM —À 

On substitution of these values in Eq. (10.21) we get P sid 


da, + 10а, + баз = 84 
10a, + 30а, + 200; = 240 
ба| + 2009 + lag = 156 
Solution of these equations results in 
a,=10 
а= 2 
а= 4 
Thus, the regression plane is 
у= 10 + 2х + 42 


[ao ILL-CONDITIONING IN LEAST SQUARES 


METHODS 


The problem of ill-conditioning can arise in implementing the least 
squares regression methods. As a consequence, the computed solution 
night differ substantially from its exact solution. This problem becomes 
nore severe when the degree of approximating polynomial is large. 
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Ill-conditioning arises basically due to very large differences in the 
coefficients of the normal equations. Recall that the coefficients are sums 
of powers and products of data values, Techniques such as pivoting and 
iterative refinement can be incorporated to overcome the problem of ill- 
conditioning. The problem of ill-conditioning can also be tackled by in- 
creasing the precision of arithmetic operations. 

Another way of overcoming the least-squares ill-conditioning problem 
is to use orthogonal polynomials. This would enable us to obtain the 
coefficients a, in closed form, thus avoiding numerical solution of simul- 
taneous equations. Further discussions on this approach is beyond the 
scope of this book. 


367] SUMMARY 


We often use experimental data for establishing a relationship between 
two variables. This relationship may be used for testing some existing 
mathematical models or establishing new ones or even estimating the 
values of dependent variables at some point. In this chapter, we have 
used a technique known as least squares regression to establish the 
following types of relationship between the variables of a table of exper- 
imental data. 

* Linear relationship 

* Transcendental relationship 

* Polynomial relationship + 

* Multivariable relationship 

Also presented are FORTRAN Programs and test results for obtaining 
linear and polynomial equations for experimental data. 


erem] 


Curve fitting Regression analysis 
Dependent variable Regression line 
Independent variable Regression plane 
Least squares regression Saturation growth rate 
Normal equations Scatter diagram 


- What is regression analysis? 

- What is a scatter diagram? 

. What is the principle of least. Squares regression? 

- Show that the linear regression line of y on x passes through the 
point that represents the mean of x and y values. 


Soma 


344 Numerical Methods 


fit data to 
(a) power-function model of the form 
у= 
(b) population growth model of the form 
у=ае“" 
using the principle of least squares. 
6. Draw a flow chart to illustrate the steps involved in developing a 
program for multiple regression. 


P Derive normal equations for evaluating the parameters а and b to 


1. Use least squares regression to fit a straight line to the data. 


= ата ати гарт» и 
У 1 2 4| 4 Б + 7 8 
Along with the slope and intercept, also compute the standard er- 
ror of the estimate. 
F^ In an organisation, systematic efforts were introduced to reduce 


Fa the employee absenteeism and results for the first 10 months are 
shown below: 


мепа | 3 [2] aTa eLeëlzlsip] 10 


Absentees | " | " 
ipe du] ю |» | е |85 9| в | а5| 7] 8 |7: 


Fit а linear least squares line to the data and from this equation 
estimate the average weekly reduction in absenteeism. 
3. The following table shows heights (i) and weights (w) of 8 persons. 


Z^ item) [175 | 165 | 160 | 180 [ 150 | 170 | 158] 185 
w(kg) | 68 


в | 59 | 7 | 51 | 62] 5а | 68 


Assuming a linear relationship between the height and weight, find 
the regression line and estimate the weights of the persons with 
the following heights. 
(a) 140 cm (b) 163 cm (с) 172.5 
4. Fit a geometric curve 
у= 
ta the following data: 


xIzse[-i[6Ta]12] 8T 4. 
У aj 6 o | -5 [-41[ 130] 300 
5. Given the table of points 


lola ats [38 2 | 16 | 20 
у o | 12 | is | 2 | 20 [ ac] 2: | 20 


ох 


Th 
12. 


14. 
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use least squares regression to fit 
(a) straight line, and 
(b) parabola 
to the data. Compute and compare the errors. 


. Fit the saturation growth rate model 


yao 
b+x 
to the data given below. 


у | 14 | 20 | 24 | 26 


. Fit the power equation 


year 


to the data given in Exercise 6. 


. Fit a quadratic polynomial to the data given in Exercise 6. 
. Use the power equation to the data 


ЕІ _[ 10 [125] ь Tavs | 20 
Lx | 34 | 16 | 12 | us | 05 | 96. 


edens = 
Use the exponential model 
be 


у=ае 
data 


- а 

x | oa | оз [|12 [16] | 24 
y 75 100 140 | 200 270 375 

Fit a parabola to the data given in Exercise 10. 


Find the least squares line у = ax + б that fits the following data, 
assuming that there are no errors in x values. 


= 
х 1 a Та Та E i 6 
y | 405 | тїз | 965 [1220 | 1520 |1900) 


g 
т 
= 
8 


. In Exercise 12, treat x as dependent variable on у and find the least 


squares line x = ay + b, assuming that there are no errors in y but x 
values contain errors. Observe that this is not the same line ob- 
tained in Exercise 12. 

Use multiple linear regression to fit 


Le Tx Т s fal 
E 4] з [271 9 | 
» 18 ]-36 38 3 16 


Compute coefficients and the error of estimate. 
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15. 


= 


‚ Modify the program LINREG to calculate the sum of squares of the 


Given the data points 


"EN: "|3 |2 
=» | 3 з 4 
y а! 15 -8 m 3l 26 


obtain a regression. plane to fit the data. 


| 


wojo + 


errors for the linear fit and print the error output. 


. A set of data, when plotted resembles ап exponential curve 


у= а) 
Write a program to evaluate the parameters а and b of this regres- 
sion curve using the principle of least squares. 


. In fitting a polynomial, its degree should be chosen such that the 


error is minimum. Given à set of data, it would be difficult, to 
decide the degree that would represent the data best, A good rule 
of thumb is to begin with the first degree and continue fitting 
higher order polynomials until 


Su. 


n-t 


or until a polynomial of degree n-1 is obtained, Q, is the sum of 
squares of errors of polynomial of degree 1. 
(a) Prepare a flow chart to fit a polynomial that satisfies this 
condition. 
(b) Modify the program POLREG to incorporate these changes. 


‚ Develop a user-friendly program for multiple regression. 
. Develop a user-friendly, menu-driven program that allows us an 


option to select. and use one of the following models to fit a given 
set of data. 

(a) Straight line model 

(b) Exponential model 

(c) Power equation 

(d) Saturation-growth rate model 


Numerical 
Differentiation 
—M — 


ax NEED AND SCOPE 


Need for differentiation of a function arises quite often in engineering 
and scientific problems. If the function has a closed form representation 
in terms of standard calculus, then its derivatives can be found exactly. 
However, in many situations, we may not know the exact function, What 
we know is only the values of the function at a discrete set of points. For 
instance, we are given the distance travelled by a moving object at some 
regular time intervals and asked to determine its velocity at a particular 
lime. Tn some other instances, the function is known bat it is so compli- 
cated that an analytic differentiation is difficult (if not impossible). In 
both these situations, we seek the help of numerical techniques to obtain 
the estimates of function derivatives. The method of obtaining the de- 
rivative of a function using a numerical technique is known в numeri- 
«eat differentiation. There are essentially two situations where numerical 
differentiation is required. They аге: 
(A The function values are known but the function is unknown. Such 


functions are called tabulated function. 
2. The function to be differentiated is complicated and, therefore, it is 
/¢ difficult to differentiate, 
dn this chapter, we discuss various numerica! differentiation methods 
that could be applied to both tabulated and continuous functions. 
Remember that while analytical methods give exact answers, the nu- 
merical techniques provide only approximations to derivatives. Numeri- 
cal differentiation methods are very sensitive to roundoff errors, in 
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addition to the truncation error introduced by the methods themselves. 
‘Therefore, we also discuss the errors and ways to minimise them. 


Jm DIFFERENTIATING CONTINUOUS FUNCTIONS 


We discuss here the numerical process of approximating the derivative 
f'(x) of a function f(x), when the function itself is available. 


Forward Difference Quotient 


Consider a small increment Ax = h in x, According to Taylor's theorem, 
we have 


2 
fice h)= fos hf B pou ал) 
for x < 0<х + h. By rearranging the terms, we get 
poga TEMA Eo) (11.2) 
А 2 
Thus, if А is chosen to be sufficiently small, f'(x) can be approximated by 


[ re Fc +һ)-[\х) (11.3) 
Aa [Л 


with a truncation error of 


Lo 214) 
af w | 


i. £t 
Equation (11.3) is called the first order forward difference quotient. This 
is also known as two-point formula. The truncation error is in the order 
of апа can be decreased by decreasing h. 

Similarly, we can show that the first-order backward difference quo- 


tient is 


fix) — fi h) 1.6) 
a 


Estimate approximate derivative of f(x) = a^ at x = 1, for h = 0.2, 0.1, 
0.05 and 0.01 using the first-order forw: а difference formula. 


fi 829 = s 
Therefore, 


pa Fe LO 
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Derivative approximations are tabulated below: 


h | fü | Brrr 
0.2 22 02. 
01 21 | OL 
0.05 2.05 | 005 
0.01 2.01 Jl 0.01 


Note that the correct answer is 2. The derivative approximation ap- 
Proaches the exact value as Л decreases. The truncation error decreases 
proportionally with decrease in A. There is no roundoff error. 


„епт! Ditterence Quotient NV® — 

Note that Eq. (11.3) was obtained using the linear approximation to /(x), 
This would give large truncation errors if the functions were of higher 
order. In such cases, we can reduce truncation errors for a given h by 
using a quadratic approximation, rather than a linear one. This can be 
achieved by taking another term in Taylor's expansion, i.e., 

f(x --h)s fle) + hf ix) - a Fx) + A fmo) (11.6) 
Similarly, 
2 з 
f(x - hy» fix) Mf) er pra - fte. (пл) 
Subtracting Ед. (11.7) from Ед. (11.6), we obtain 


Гое Го = пуча) Аре) feo.) 118) 


Thus, we have 


(11.9) 


2 2 
аа 


which is of order А2, Equation (11.9) is called the second-order central 
difference quotient. Note that this is the average of the forward differ- 
ence quotient and the backward difference quotient. This is also known 
as three-point formula. The distinction between the two-point and three- 
point formulae is illustrated in Fig. 11.1(a) and Fig. 11.1(b). Note that 
the approximation is better in the case of three-point formula. 
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Fig. 11.1 llus 


pa | 


à 


{+—— Approximation 


Qt ie slope 


x-h x x+h 
(а) 
А 
СТ Approximation 
True slope 
a E 
x-h x x+h 
[2] 
tration ot (a) Two-point formula and (b) Three-point formula 


Repeat the exercise given in Example 11.1 for the thr: 


Therefore, 


rmula. 
От 


Гожа Го 


f(x) T 
os Faso- fü- M) 
ea 


‘The derivative approximations are tabulated below: 


The derivativ 
quadratic appi 


h fa 
02 | 20 


0.1 2.0 


is exact for all values of л. This is because we have used 
roximation for a quadratic function. We can also derive 


further higher-order derivatives by using more points in the formula. 
For example, the five-point central difference formula is given by 
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fix+2h) = Врх +h) 8f + к-у 
19h | 


"Гһіѕ is a fourth-order approximation and the truncation error is of 
order А". In this case, the truncation error will approach zero much 
faster compared to the three-point approximation. The derivation of 
Eq. (11.10) is left to the reader as an exercise. (Hint; use step size 2h 
instead of h in Eq. (11.8) and use up to fifth derivative of Taylor's expan- 
sion). 


| fice) 


Error Analysis 
As mentioned earlier, numerical differentiation is very sensitive to round- 
off errors. If E,(A) is the roundoff error introduced in an approximate 
derivative, then the total error is given by 
Eth) = Eth) + EAR) 

Let us consider the two-point formula for the purpose of analysis. That is, 
Fix «h- fl) _fi-fo 

h h 


If we assume the roundoff errors in ў; and f; as е, and eg, respectively, 
then 


fx) = 


рш) Ваен) 


һ-% е-е 

hh 

If the errors e, and e, are of the magnitude e and of opposite sign (i.e | 
the worst case) then we get the bound for roundoff error as 


|E, (ha х ŽE 
We know that the truncation error for two-point formula is 
= Арн 
| E, (А = af (8) 
or 


Mah 


E,UD| < 
16,00 < = 


where M; is the bound given by 
М; = max |/"(@| 
Aas Osxth 


À x 
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Thus, the bound for total error in the derivative is 


Mh | 2e 

2 А 
Note that when the step size A is increased, the truncation error in- 
creases while the roundoff error decreases. This is illustrated in Fig. 
11.2. For small values of h, roundoff error has an overriding influence on 
the total error. Therefore, while reducing the step size, we should exer- 
cise proper judgement in choosing the size. This argument applies to all 
the formulae discussed here. 


аза) 


ECA) < 


Total error- 
Error Truncation error 
РА Round off error 
Pot == 


Fig. 11.2 Error in derivatives as a function of ^ 


We can obtain a rough estimate of h that gives the minimum error. By 
differentiating Eq. (11.11) with respect to A, we obtain 


ру Ma... 2e 
E'(h)- 2 


We know that E(A) is minimum when E'(A) = 0. That is, 


M 2. 
2 1 
Solving for л, we obtain 
e 
ЫЫ үт (11.12) 


Substituting this in Eq. (11.11), we get 
Elhop )- 2JeM3.- (11.13) 
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Ey HEN 


Compute the approximate derivatives of f (x) = sin x, atx = 45 radians, 
at increasing values of A from d to 0.04, with a ер size of 0.005. 


fix) = sin x 

Using two-point formula 
a) аА) fel ae 

f'a) = ® 

Given 
x = 0.45 radians 

So, f (x) = sin (0.45) = 0.4350 (rounded to four digits). Exact f'(x) = cos x 
= 00400.45) = 0.9004. ~ 


Table helow gives the approximate derivatives of sin x at x = 0.45 " 
using various values of A. - 


h farh fw Error | 
0.010 | 0.4439 | 08900 | 0.0104 
0.015 0.4484 0.8933 0.0071 
0.020 | 04529 | 0.8950 | 0.0054 | 
0.025 | 04573 | 0.8935 | 0.0069 | 
| 


0.030 0.4618 | 0.8933 0.0071 
0.035 0.4662 0.8914 0.0090 
0.040 0.4706 | 0.8900 | 0.0104 


The table shows that the total error decreases from 0.0104 (at Л = 0.01) 
till A = 0.02 and again increases when A is increased as illustrated in 
Fig. 11.2. 

Since we have used four significant digits, the bound for roundoff 
error e is 0.5 х 10 *. For the two-point formula, the bound M, is given by 


M = max |f") 
0.40 < 0< 0.49, 
= |sin (0.49)| = 0.4706 


‘Therefore, the optimum step size is 


[0.5 x10 
ЗЕ ү за Ге = 
Pope үм, * e 0.4706 
- 0.0206 


This agrees very closely with our results. 
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Higher-order Derivatives 

We can also obtain approximations to higher-order derivatives using 
Taylor's expansion. To illustrate this, we derive here the formula for 
Р"). We know that 


nx) В, 


Рос Ро) MfG BE pets + у 


and 


Flax - h) - fi) - hf x) + БГ") Ee Ry 


Adding these two expansions gives 

fix +h) + fœ- h) = fla + рО) +R, + Ry 
Therefore 
flx+h)-2f(x)+ fix - h). (Ry + В) 


Fix) - hz Du 
Thus, the approximation to second derivative is 
+h) 2f h 
Е "(x ANN mmm | (11.14) 
— — 
The truncation error is 
В, + Р, 
гун 
А 
а= aree 


The error is of order A*. 
Similarly, we can obtain other higher-order derivatives with the er- 
rors of order л? and A*. 


2 
Find approximation to second derivative of cos (x) at x = 0.75 with 
he 0.01. Compare with the true value. ~ 


= fix «&)- 2f) fix - À) 
F а 
0.76) 2/60. 0. 
100.76) 2f(0.75) + f(0.74) өй бй 


f 40.75), =" 0.0001 
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_ 0.7248360 — 2(0.7316888) + 0.7384685 
Е 0.0001 


— 14623046 – 1.463; 
ШШЕ RON 
= -0.7300000 
Exact value of /"(0.75) = — cost0. 
" — 


= - 0.7316888 
Error = – 0.016888 _ 
This error includes roundoff error as well. 


EE DIFFERENTIATING TABULATED FUNCTIONS 


Suppose that we are given а set of data points (x,, f,), i = 0, 1, ... п which 
correspond to the values of an unknown function /(x) and we wish to 
estimate the derivatives at these points, Assume that the points are 
equally spaced with a step size of A. 

When function values are available in tabulated form, we may 
approximate this function by an interpolation polynomial p(x) discussed 
in Chapter 9 and then differentiate p(x), We will use here Newton's 
divided difference interpolation polynomial. 

Let us first consider the linear equation 


p(x) = а, + Oy = х0) + R, 


where R, is the remainder term used for estimation. Upon differentia- 
tion of this formula, we obtain 


dR, 
dx 


Then the approximate derivative of the function f(x) is given by 


piix) =a, 1 


fx) = рух) = a, 
We know that 
a, = flxy xi 


fido fo) 
ә = 


On substituting 


hari= 
PETTY 
Ех N 
we get i 
f fern te | (11.15) 
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This is the familiar two-point forward difference formula. 
Now, let us consider the quadratic approximation. Here, we need to 
use three points. Thus, 
PAX) = ау + аүх- ху) + азіх — xo) (x - xj) + Ry 
Then 
руб) = ay + ах - ху) + (к—ху)} + mE 


Thus, we obtain 
Ро) = a, + as [к — x9) + (x - х)] (11.16) 
Let ху = x, x, = х +A, xy =x + 2h, Then 
foh) - Га) 
e h 
fis xil fissi] 


вз = f lo, xy ху] = PEE 


Га) Гар) fr) fig) 


жу eiui 


X2 -30 
_ f(x + 2h) - 2f(x +h) + fle) 
Е 2h? 
Substituting for а, and ay in Eq. (11.16) and after simplification, we get 
foe) = BE + Af ce hy) а + 2h) | 


E (11.17) 


‘This із a three-point forward difference formula. We can obtain a three- 
point backward difference formula by replacing Л by -h in Ед. (11.17). 
Therefore, the three-point backward difference formula is given by 
(x) = ЗР) - Af А) fix - 2h) 
Г") = UL HE MM 
Similarly, we can obtain the three-point central difference formula by 
letting xo =x, x} =x — hx, =x +h in Eq. (11.16). Thus, 
Га) fix - n) 
ape жш 
f(x +h) - 2f(x) + fix h) 
Se UT ш 


(11.18) 
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Substituting these values in Eq. (11.16) we get 


f(x+h)-f(x-h) 


ftx)= Hh 


(11.19) 


Error Analysis 
Let us first take the linear case. 
Ry = f lxo, xj, x] (x — x9) (х- ху) 
where , 
fixo xy xl = P8 


for some point q in the interval containing xy, xy and x. Then 
dR. Г") 
а == Ик - ху) + (к х 


Letting ху =x and x; =x + h 
E == rao, х<0<х+һ 
‘Therefore, the truncation error is of order A. This conforms with 
Eq. (11.4). Now, let us consider the quadratic approximation. 
В, = Г ху, ху, x] (х— x9) (x - x1) (x — xg) 


(8) 
flo, Ху, xy xl P 


for some point q in the interval containing xo, х1, xa and x. 
Я (д) 
m - De [x — xo) (x — x1) + (x р) (7 ху) + (к- x9) (x ~ xl 
By setting x, =x, x, = x + h and xj =x + 2h, 


2% = pmo, x<Osx+h 


This error equation holds good for both forward and backward three- 
point formulae. 

For central difference formula, we must set xp = x, x; = x – À and 
х;=х+һ. Therefore, 


NE LET ” А 
^ qf. x-hs0ss h 
Note that the error is of order A? 
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The table below gives the values of distance travelled by а car at various 
time intervals during the initial running 


E ЕР ; 
Time, t(s) 5 ] в в | ә | 
Distance 10.0 145 195 | 255 | 320 

travelled, t | | | 
бл) = | 


Estimate velocity at time ¢ = 5,t- 7 and t 


We know that velocity is given by the first derivative of sít). At 1= 5, ме 
use the three-point forward difference formula (11.17). 


390) +4600 + h) - sit + 2h) 


v(t) = 3k 
Then 
3(10)  4(14.5) - 19.5 
н ee 
P 
= 4.25 km/s 


At t = 7, we use the central difference formulae (11 19). Therefore, 


_25.5- 14.5 
NE 
At ¢ = 9, we use the backward-difference formulae (11,18) 


= 5.5 km/s 


_ 3032) ~ 4(25.5) + 19.5 


5 


= 6.75 km/s 


Higher-order Derivatives 
Formulae for approximating the second and higher derivatives can also 


be obtained from the Newton divided difference formula. The second- 
order derivatives are as follows: 


Central 


Еа 
ЕКОЕ WB) 


к? (11.20) 
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' @ї 


=h? gen 
Error = 45 fOO 


Forward 
т") = 2[\х)-5[(х +һ)+ КЕ 2h) - fix + 3h) 120 
LA? pw 
Error = ^45 гө) 
Backward 
fia) = 2f(x) -5fix - h) + Afix - 2h) — fx 3h) (11.22) 


h? 
Erro LAM pw (gy 
12 


pn ae 
Use the table of data given in Example 11.5 to estimate acceleration 
att=78 


Acceleration is given by the second derivative of s(t). Therefore 


5 - ) 
deri +h) Balt) + a(t h 

he 
Therefore. 


25.5 - 2119.5) + 14.5 


ath) - —— —Àz 


The equation for deflection of a beam is given by 


y'-e?-0 200) = OY 20 


Estimate, using a second-order derivative, the approximate deflections 
at x = 0.25, 0.5, and 0.75. Note that у(х) is the deflection at x. 


ук +h) -2yG) уб A) uu 


y"@ y= ro 


h-025 
Then, 
у(х +0.25) - 2560 y — 
0.0625 
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Consequently, 


убх + 0.25) ~ 2ух) + у(х — 0.25) = 0.0625 e** 
Substituting x = 0.25, 0.5 and 0.75 in the above equation in turn, we get 


(0.5) – 2¥(0.25) + (0) = 0.0665 
0.75) ~ 2у(0.5) + у(0.25) = 0.0803 
У(1) ~ 2у(0.75) + 300.5) = 0.1097 
Given y(0) = (1) = 0. Denoting 
уу = 200.25), у= 00.5) апі 
We have 
0 +y,- 2у, = 0.0665 
y3- 2y +y, = 0.0808 
-2y3 + у; + 0 = 0.1097 
Solving these three equations for y,, y, and 
Yı = (0.25) =- 0.1175 
Уз = 0.5) =- 0.1684 
Уз = (0.15) =- 0.1391 


уз = 000.75) 


у» we get 


КЕ DIFFERENCE TABLES 


Tables 11.1 to 11.3 list difference derivatives /'(x) and [''(х) and associ- 
ated errors for forward, backward and central difference formulae, Fol- 


lowing notations are used: 
f, denotes f(x + 2А) 
fg denotes f(x ~ 2А) 


Table 11.1 Forward difference deriv 


[Derivative Formula | Err | 
| fo *fi | h 
E m | 3 
d [77 
ЕТ |. 
Ре) кй fa ‚А? re) 
3 
L m (е) 
Afo + 18/1 -9f242fy NN Bro а 
6h 179 
| ——-— ДИЮ. 1] 
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Table 11.1(Contd) 


Derivative Formula | Error 
; -25fo + 48f, — 36/з + 16/з - 3f. ES uii 
ге I a sus 
(з2е/һ) 
fo - 25 + fa " 
лы > hf 
[xg (Ael?) 
[Г 975 287, +47 - fa 7 ‚ША poo =) 
[E 12 
l (12e/h?) 
#Roundolf error 
Table 11.2 Central difference derivatives 
Derivative Formula Error 


-А® р» 
6 Р (0) 


[oy 
E га fo 
[7:2] 
faA-3fo* һ? 
AL pu 
ге 15 feu 
fi) (аел) 
ee SS Се” =| 
-2f_ + 16/_, - 30fa + 16/; ~ fe BA лау 
"m em sol e 
_ А (162/3ћ2) 
#Roundoff error 
Table 11.3. Backward difference derivatives 
Derivative Formula Error 
-fath B es 
= at (e 
| | (gofbY i 
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Table 11.8/Contd ) 


Derivative is Formula ИШ | Er 
Ёо 471 +30 “ 
—M— ШШ 
2h diio 
fog L 2 ет) 
4 
Be put (gy 
4 
(20e/3h) 
| as e: эр! i айды! = 
З/-4 ~ 16/-з + 36/_, - 48f_, +257 Аа вө, 
12h 5 
(del) 
f-2 -2f-1 + fo " 
MEO үө, 
[ NE. NM 
14? nis, 
H Fu 
Lo = v (120th) __ 
#Roundoff error 


HFS RICHARDSON EXTRAPOLATION 
ee ila 


Richardson extrapolation is based on a model for the error in a numeri- 
cal process. This is used to improve the estimates of numerical solutions. 
Let us assume 

xi + МА" (11.93) 


x, is the kth estimate of solution x^ and M А" is the error term. Let us 
now replace В by rh and obtain another estimate for А 


а= + Мт" (11.24) 
Multiplying Eq. (11.23) by 7^ and solving for x*, we get 
он 
. хуг, | 


М жира ЧУ, (11.25) 


This is known as Richardson extrapolation. estimate. Note that the error 
term has been eliminated. 


This concept can be extended to the estimation of derivatives dis- 
cussed so far. Using this, we can obtain a higher-order formula from a 
lower-order formula, thus improving the accuracy of the estimates. This 
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process is known as extrapolation, Let us consider the three-point 
central difference formula with its error term. 
f*h)-ftx-h) h? 


poas pty 


= Din) - 6 ra) (11.26) 


where D(h) is the estimate obtained using h as step size. Note that f'(x) 
is the exact solution which is usually approximated by D(A). If we re- 
move the error term, then we can obtain a better approximation. Now, 
let us obtain another approximation for f'(x) by replacing ћ by rh. Thus, 


fix s rh) - fix- rh) hêr? 
лт жаша 


© 2hr 


Р") 


At 


2 
= Dirh) g "O (11.27) 


We can eliminate the error term by multiplying Eq. (11.26) by г? and 
subtracting it. from Eq. (11.27). The result would be 


(11.28) 


This would give a better estimate of f(x) as we have eliminated the 
error term A. For r = 2, Eq. (11.28) becomes 


f(x - 2h) - Bf(x - h) -Sf(x + А) - fix + 2$) 
à 12h 


Note that this is a five-point central difference formula which contains 
error only in the order of h*. We can repeat this process further to 
eliminate the error term containing A* and so on. 

One of the most common choices of r is 0.5. Letting г = 1/2, Eq. (11.28) 
becomes 


fo (11.29) 


fix — Вос h/2) + BfGx + h/2) - fic h) 
7 6h 
Note that the use of this formula depends on the availability of function 


values at x + h/2 points. This will be a restriction when Richardson's 
extrapolation technique is applied to tabulated functions, 


FG (11.30) 


Show that, using the data given below, Richardson's extrapolation tech- 
nique can provide better estimates for derivatives. 
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r 7 i T Sa iar z 
jos [-es[-oz] o loss | os Газ | 10 | 12 


Fix) = е | 0.6065 0-7788 [1.0000 11.2840 [1.5467 [2.1170 277168 3420 


T 
| 18 | 


Sam) 


Let us estimate f'(x) at x = 0.5 and assume h = 0.5 and r = 1/2, Then, 
using three-point central formula, we have 
_ /01.0)-/(0› 


Dth) = 00.5) = -2x05 = 17183 


f(0.75)- 500.25) _ 


Dirh) = D(0.25) = д 
0.5 


1.666 


Therefore, 
60 - 0.25(1.718% 
f 0.5) = ——————— E 


= 1.6486 


Note that the correct answer is 1.6487. The result is much better than 
the results obtained using three-point formula with = 0.5 and h = 0.25. 

Now, let us take r = 2. Again using the same three-point central 
formula, 


f(L5) - f(-0,5) 
(2)(0.5)(2) 


Dirh) = Du.0) = = 1.9376 


This shows that the estimate with г = 1/2 is better than the estimate 
with r = 2. 


[16] SUMMARY 


In this chapter, we have seen how numerical differentiation techniques 
may be used to obtain the derivative of continuous as well as tabulated 
functions, We have used forward, backward and central difference quo- 
tients to obtain derivative equations. We have also seen how Richardson 
extrapolation is used to improve the estimates of numerical solutions. 
The discussions in this chapter bring out the following points: 
* If we are given n + 1 data points equally spaced, the interpolating 
polynomial will be of order л, and the nth derivative will be the 
highest that can be obtained. 
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e Better approximation of derivatives can be achieved by using more 
points in the formula. 

For a given number of data points, the central difference formula is 
more accurate than their forward or backward counter parts. 
Roundoff error grows when à gets small. We will always face the 
step-size dilemma. One way to overcome this problem is to use a 
formula of higher order so that a large value of A will produce the 
desired accuracy. 

The problem becomes more pronounced when working with experi- 
mental data which contain not only roundoff errors but also mea- 
surement errors. In such cases, we should first fit a curve to the 
data by using least-squares technique and compute derivatives for 
the curve. 


Backward difference derivative Five-point formula 

, Backward difference quotient Forward difference derivative 
Central difference derivative Forward difference quotient 
Central difference quotient Richardson extrapolation 
Difference tables Two-point formula 
Extrapolation Three-point formula 


What is numerical differentiation? 

/7 Why do we need to use numerical techniques to obtain the esti- 

mates of function derivatives? 

3. What are the three primitive numerical differentiation formulae? 
Compare their truncation errors. 

4. What is three-point formula? How is it different from the two-point 
formula? Ilustrate the difference using geometric interpretations. 

5. Derive the five-point central difference formula 


_ -fix +2h)+8f(x  h) Bf (a - h) + f(x - 2h) 
3 12h 


Also estimate the order of truncation error. 
6. Describe the effect of step size h on 
(a) truncation error, 
(b) roundoff error, and 
(с) total error. 
7, Using Taylor's expansion, derive a formula for computing second 
derivative of a function. 
8. Derive a three-point difference formula for estimating the first de- 
rivative of a tabulated function. 


Ff 
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9. 


10. 


Derive a formula to estimate the second derivative of a tabulated 
function. 

What is Richardson extrapolation? How does it improve the esti- 
mates of derivatives? 


. Estimate the first derivative of f(x) = In x at x = 1 using the first 


order. 
(a) first-order forward difference formula, 
(b) first-order backward difference formula, and 
(c) second-order central difference formula. 
Compare the results with the exact valite 1. 


. Estimate the first derivative of f(x) = In z using the five-point 


central difference formula. How does the result compare with the 
results obtained in Exercise 1. 

Compute the approximate first derivatives of f(x) = cos q at x = 0.75 
radians at increasing values of h from 0.01 to 0.05 with a step size 
of 0.005 (using four decimal digit). Analyze the variations ef error 
in each step. 


. Apply the three-point central difference formula to obtain estimates 


of the first derivatives of the following functions at x = 1 with 
A = 0.01. Compare the results with true values. 

(a) eosh x 

(b) exp(x) sin x 

(с) In Q +x") 

(d) 3542x241 


1 
irm 


. For each of the following functions 


(а) свх х=15 
(b) exp(x/2) х=2 


ө тат ==1 


estimate the size of A that will minimize total error when using the 
three-point central difference formula. 


. Estimate the first derivatives of the functions given in Exercise 5 


at the indicated points using the optimum size h obtained. 


. Use the three-point formula to estimate the second derivatives of 


the functions given in Exercise 4 at x = 0.5 with А = 0.01. 


. Given below the table of function values of f (x) = sin h(x). Estimate 


the second derivatives of f(x) at x = 1.2, 1.3 and 1.4 using a suitable 
formula. 


Numerical Differentiation 367 


x 11 


13 


13 


[3 ] 


fœ | 13356 


1.5095 


1.6983 1.9043 


| 
| 
Js 


| 2.1293 


9. Current through a capacitor is given by 


dv i 
=C = Cv) 


where v(t) is the voltage across the capacitor at time t and C is the 
capacitance value of the capacitor. Estimate the current through 
the capacitor at ¢ = 0.5 using the two-point forward formula with a 
step size Л = 0.2. Assume the following: 


ud) = (t + 0. 
C=2F 


1) e* volts 


10. Using the function in Exercise 8, estimate the first derivative at 
x = 1.3 with A = 0,1 using the three-point centre formula. Compute 
an improved estimate using Richardson extrapolation, Exact value 
of f'(x) = cosh(1.3) = 1.9709. 

11. Evaluate the first derivative at x = —3 and x = 0 of the following 


table function: 


x | -8 Ta f[-1] o[ 1] 2 3 
у | си [шз | o | 3 2 | эз | 


12, Compute the first derivative for the following table 


x = 0.75, 1.00 and 1.25. Use A = 0.05 and 0.1. 


of data at 


x 0.5 0.7 


оэ | aa [аз 


1.5 


E t 1.64 


178 | 189 | 1.96 


1.00 


| 


Compare the results with A = 0.05 and А = 0.1. Comment on the 


differences, if any. 


13. The following table gives the velocity of an object at various points 


in time 
pee ЕЧ si 
| Time (seconds) | 1 | 12 | 16 | 18 | 22 | 24| 28| 30| 
Velocity (m/sec) | 90 | 95 wz) 110| 13.2 | 14.7 | 18.7 | 22.0 | 
| a be 3 


suitable value for h. 
14. 
given as follows: 


The distances travelled by 


a vehicle at intervals of 


Find the acceleration of the object at T = 2.0 seconds. Assume а 


2 minutes are 


Time (seconds) | 0 | 2 


4 


12 


14 


Distance (km) | 0 |025 1 
| as 


8.5 


11 
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Evaluate the velocity and acceleration of the vehicle at T 
and 13 seconds. 


- Write a program that will read on the values of x and /(x), compute 
approximations to f'(x) and f(x), and output x, f (x), f(x) and fà) 
in four columns. 

. Write a program that will compute the total error at increasing 
values of A at regular steps and then estimate that value of A for 
which the total error is minimum. Assume a formula of your choice, 

. Write a program to evaluate a given function at various points of 
interest and estimate its first and second derivativ 
fied point. 
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ылы с „ш 


NEED AND SCOPE 


Like numerical differentiation, we need to seck the help of numerical 
integration techniques in the following situations: 
1, Functions do not possess closed form solutions. Example: 


fix) =C fe-dt 
i 


2. Closed form solutions exist but these solutions are complex and 
difficult to use for calculations. 

3. Data for variables are available in the form of a table, but no 
mathematical relationship between them is known, as is often the 
case with experimental data. 

We know that a definite integral of the form 


b 
I =| ғо) (12.1) 
a 


can be treated as the area under the curve у = f(x), enclosed between 
the limits x = a and x = b. This is graphically illustrated in Fig. 12.1. The 
problem of integration is then simply reduced to the problem of finding 
the shaded area. 

One simple approach is to plot the function on а graph paper contain- 
ing grids and find the area under the curve using the number grids 
covered under the desired boundaries. The accuracy of this rough esti- 
mate can be improved by using finer grids. 
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i b gem 
Fig. 12.1 Graphical representation of integral of a function 


Although the grid method and other such graphical approaches can pro- 
vide us rough estimates, they are cumbersome and time-consuming and 
the final results are far from satisfactory limits. A better alternative 
approach could be to use a technique that uses simple arithmetic opera- 
tions to compute the area. Such an approach, if necessary, can be easily 
implemented on а computer. This approach is called numerical integra- 
tion or numerical quadrature. Numerical integration techniques are simi- 
lar in spirit to the graphical methods. Both of them use the concept of 
"summation" to find the area. 

Numerical integration methods use an interpolating polynomial p,(x) 
in the place of f(x). Thus 


b b 
T=] Ро) а= f p, (x) ах (12,2) 


We know that the polynomial p,(x) can be easily integrated analytically, 
Equation (12.2) can be expressed in summation form as follows: 


t a 
[pa GO dx ўиш, pa Gi) (12.3) 
Š E 


where @ = xo < x} < «x, = b 
Since p,(x) coincides with f(x) at all the points x, = 0, 1, ... л, we can 
say that, 


f(x) dx = Zwi (x,) (12.4) 


The values x, are called sampling points or integration nodes and the 

constants ш, are called weighting coefficients or simply weights. 
Equation (12.4) provides the basic integration formula that will be 

extensively used in this chapter. Note that the interpolation polynomial 
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Pp, (x) was used only to derive the formula (12.4) and will not be used in 
the computation directly. Only the actual function values at sample 
points are used in numerical computation. 

There are various methods of selecting the location and number of 
sampling points. There is a set of methods known as Newton-Cotes rules 
in which the sampling points are equally spaced. Another set of methods 
called Gauss-Legendre rules, uses sampling points that are not equally 
spaced, but are designed to provide improved accuracy. In this chapter, 
we discuss these two sets of methods in detail. We also discuss a method 
known as Romberg integration that is designed to improve the estimates 
of Newton-Cotes formulae. 

In general, numerical integration methods yield much better results 
compared to the numerical differentiation methods discussed in the pre- 
vious chapter. This is due to the fact that the errors introduced in sepa- 
rate subintervals tend to cancel each other. However, the estimates are 
still approximate and, therefore, we also consider the magnitude of er- 
rors in each of the methods discussed here. 


В] NEWTON-COTES METHODS 


Newton-Cotes formula is the most popular and widely used numerical 
integration formula. It forms the basis for a number of numerical inte- 
gration methods known as Newton-Cotes methods. 

The derivation of Newton-Cotes formula is based on polynomial in- 
terpolation, As pointed our earlier, an nth degree polynomial p,(x) that 
interpolates the values of f(x) at п + 1 evenly spaced points can be used 
to replace the integrand / (x) of the integral 


b 
1= [fG)dx 


and the resultant formula is called (n + 1) point Newton-Cotes formula 
If the limits of integration a and 6 are in the set of interpolating points 
x, i = 0, 1, ... n, then the formula is referred to as closed form. If the 
points а and 6 lie beyond the set of interpolating points, then the formula 
is termed open form. Since open form formula is not used for definite 
integration, we consider hore only the closed form methods. They in- 
clude: 


1. Trapezoidal rule (two-point formula) 
2. Simpson's 1/3 rule (three-point formula) 
3. Simpson's 3/8 rule (four-point formula) 
4. Boole's rule (five-point formula) 


All these rules can be formulated using either Newton or Lagrange 
interpolation polynomial for approximating the function f(x). We use 
here the Newton-Gregory forward formula (Eq. (9.20)) which is given 
below: 
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2 3 
pile) = fo + Afas + у se- D4 SÍ is) 6-214 
27) T eT. T, 12.5) 
where 
s = (x — xo)/h 
and 
Аах, 


BEES] TRAPEZOIDAL RULE vA (ШЕ 
и =. 


The trapezoidal rule is the first and the simplest of the Newton-Cotes 
formulae. Since it is a two-point formula, it uses the first order interpo- 
lation polynomial p,(x) for approximating the function f(x) and assumes 
xo = а and x, = b. This is illustrated in Fig. 12.2. According to Eq, (12.5), 
р(х) consists of the first two terms T, and T}. Therefore, the integral for 
trapezoidal rule is given by 


(| 


ҚБ) +f (a) 
M (ba) 


ха x b х 
Fig. 12.2 Representation of trapezoidal rule 
b 
1, = (To + Ti)dx 


b b 
= [Т dx |T, dx 9 14 + la 
P 


а 


Since 7, are expressed in terms of ѕ, we need to use the following trans- 
formation: 


and h=b-a 
a —xpVh = 0 
-x/h =1 


At 
At 
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Then, 
b 1 
In = Í To dx - [АР а= Мо 
а ° 


à н 
1 = {Ту de=[Afy shds=h А, 
1 


а 


Therefore, 


1, Z н мА) 


Since fy = f(a) and f, = f(b), we have 


ES 
|^ п ПА ро v FD | (126) 


Note that the area is the product of width of the segment (b — a) and 
average height of the points fla) and fib). 
Error Analysis 


Since only the first two terms of eq. (12.5) are used for I, the term T; 
becomes the remainder and, therefore, the Lruncation error in trapezoi- 
dal rule is given by 


£*(0,)1 


b 
Е, |Т» de= — > sts - DA-ds 

а 2 о 

Г"(Ө.)һү з “Т LO), 

72 з 2% 12 
Since dx/ds = h, 

б) = h? f’(8), 
we obtain 


(12.7) 


where a < 6, « b 


BH nus 


Evaluate the integral 


è 
I-|G3 +1)йх 
а 
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for the intervals (a) (1, 2) and (b) (1, 1.5) 
Also estimate truncation error in each case and compare the results 


with the exact answer. 


If (a) + f (5) 


-l(s.9)-55 


ю 


h? 
E, == ~; 
1841 5 тту iol 


Г") = 6x 
max|/ "(x)| = f”(2) = 12 
DEM 
‘Therefore, 
3 en 
lEn] 55 ["@=1 


Y 
True error. = Г, 14,4 = 0.75 r^ d 
Note that the error bound is an overestimate of the true error. 
Case b a=1,b=15 
һ=05 


1, 25 1/01) + F(.5)] = 1.59375 


з 
18,1 = gar f" 0.5) = 0.09375 


Тыла = 1.515625 
True error = 0.078125 


Composite Trapezoidal Rule 

If the range to be integrated is large, the trapezoidal rule can be im- 
proved by dividing the interval (a, 6) into a number of small intervals 
and applying the rule discussed above to each of these subintervals. ‘The 
sum of areas of all the subintervals is the integral of the interval (c, b). 
This is known as composite or multisegment approach. This is illustrated 
in Fig. 12.3. 
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халх 1=0,1,..п-1 
Fig. 12.3 Multisegment trapezoidal rule 


As seen in Fig. 12.3, there are n + 1 equally spaced sampling points 
that create л segments of equal width A given by 


b-a 
no 


h= 


xocatih  izsÜü,l..n 
From Eq. (12.6), area of the subinterval with the nodes x,_, and x, is 
given by 


= foyer de = 3 (fo, o) 


The total area of all the л segments is 


T $A 2 (f, + fil 


= È (fig «fel + Ж tfe) «fol 


+È afi, +f 


Denoting f; = f (x,) and regrouping the terms, we get 


nl 

[^ +25 “| (12.8) 

Equation (12.8) is the general form of trapezoidal rule and is known 

as composite trapezoidal rule. Equation (12.8) can also be expressed as 
follows: 
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= (plays fo +h "fla i 
[21 


Similarly, we can estimate the error in the composite trapezoidal rule 
by adding the errors of individual segments. Thus 


= yp. 
Es =~ Ty pu (8,) (12.9) 


We know that f ¢¢(q;) is the second derivative at q;, x,_, < q, < x, If the 
maximum absolute value of the second derivative in the interval (a, b) is 
F, then we can say that the truncation error is 


(12.10) 


Theoretically, we can say that it is always possible to increase accuracy 
by taking more and more segments. Unfortunately, this does not happen 
always. When the number of segments/ncreases, error due to rounding 
off increases. 


Compute the integral 


1 
Г ede 
i 


using composite trapezoidal rule for (a) n = 2 and (b) n = 4. 


Case a 
Ё al 
hz 2 If (a) + РЛ + У Ра +1) 
íi 
-i lexp (-1) + ехр(1)! + ехр(0) 
= 2.54308 
Case b n=4 
_b-a 


h 205 
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+ lexp(—1) + exp(1)] + lexp(—0.5) + ехр(0) + exp(0.5)] 0.5 


= 2.39917 
Note that Т = 2.35040 and n = 4 gives better results. 


Program ТРАРЕТ 


Trapezoidal rule is a simple algorithm and can be implemented by a few 
FORTRAN statements as shown in the program TRAPE1, Note that the 
major computation is done by just one statement in a DO loop. This 
statement calls a function subprogram to evaluate the given function at 
a specified value of x. We can use this Program to integrate any function 
by simply changing the function definition statement 


Fel EXP (-X/2.0) 


Main program + 


* This program integrates a given function 

* using the trapezoidal rule . 
A E eran be кайненен c Esau ^ 
* Functions invoked * 
. Р . 
* Subroutines used " 
* NID a 
* Variables used ы 
+ А - Lower limit of integration Ы 
* B - Upper limit of integration * 
* H - Segment width * 
ii N - Number of segments * 
E E 


ICT - Value of integral 


WRITE(*, *) ‘Give initial value of x’ 
READ(*, *) A 

WRITE(*, *) ‘Give final value of x’ 
READ(*, *) B 
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WRITE(*, *) ‘What is the segment width?‘ 
READ(*, *) H 
N = (B-A)/H 
SUM = (F(A) + F(B))/2.0 
DO 10 I = 1, N-1 
SUM = SUM + F(A+I*H) 
10 CONTINUE 


ICT = SUM * H 


WRITE(*, *) 
WRITE(*, *) 'INTEGRATION BETWEEN', A,' AND', B 


WRITE(*, *) 

WRITE(*, *) ‘WHEN Н =’, Н, * IS’, ICT 
WRITE(*, *) 

STOP 


REAL FUNCTION F{X) 
REAL X 


Р = 1-EXP(-X/2.0) 


E onem mme mn ---- End of function F(X) 


Test Run Results Test run results shown below give the value of inte- 
gration of the equation 


fe) =1-г*% 
from 0.0 to 10.0. 


Give initial value of x 


0.0 

Give final value of X 
10.0 

What is the segment width? 
0.5 


INTEGRATION BETWEEN .0000000 AND 10.0000000 
WHEN H - 5.000000E-001 IS 8.0031400 


Stop - Program terminated. 


Numerical integration 379 


Hae SIMPSON'S 1/3 RULE 


Another popular method is Simpson's 1/3 rule, Here, the function f (x)is 
approximated by a second-order polynomial pax) which passes through 
three sampling points as shown in Fig. 12.4. The three points include 
the end points а and b and a midpoint between them, i.e., x) = a, x; = Б 
and x, = (a + 6)/2. The width of the segments A is given by 


b-a 


2 


һ= 


X-a x „=й x 


Fig. 12.4 Representation of Simpson's Three-point rule 


The integral for Simpson's 1/3 rule is obtained by integrating the first 
three terms of equation (12.5), i.e., 


b b 
1,4 7 [pa (x) de (То + Ty + T2) dx 
a a 


b b b 
= [Ty dz |T; dx + [Ta dx 


= La + Ia + аз 
where 


n 
In = [fo dx 
а 


b 
Таз = J Afos dx 


baz 
Lui] A s(s- 1) dr 
E 
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We know that dx = A x ds and s varies from 0 to 2 (when x varies from a 
to b). Thus, 


Las јана = 24, 


2 
Ing = [Mosh ds = 2hAf; 
a 


2 
tas [228 
И 


sis - Dh ds- Af, 


Therefore, 


2 
= а (12.11) 


ы= {ж +24 +3 


Since Mf, = f, ~ fy and A? fy = fy — 2f, + fos equation (12.11) becomes 


p = ot ah +] = È ire) + m ron] (12.12) 


This equation is called Simpson's 1/3 rule. Equation (12.12) can also be 
expressed as 

Fla)+4f(x;)+ f(b) 
a) $e 


This shows that the area is given by the product of total width of the 
segments and weighted average of heights fla), fixy and f(b). 


Ia 


Error Analysis 


Since we have used only the first three terms of Eq. (12.5), the trunca- 
tion error is given by 


è 
En 7 [Ts de 
а 


"| 2 
a $2 jats- Us -2)h de 
0 


Since the third-order error term turns out to be zero, we have to con- 
sider the next higher term for the error. Therefore, 


b 
Жш 7 fT, de 
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(4) 2 
=? Oe) [sts D(s- 2) (s- 9h ds 
° 


Кыныр 115? a 


24 BE uu €; El 


Af*(0,) 
90 
Since f4(0,) = A* f (8), we obtain 


5 
[Ж =-4 fOO) (12.13) 


>ш "| 


pson's 1/3 rule is exact 


— mapapus: NUR 


Evaluate the following integrals using Simpson's 1/3 rule 


1 А 
(а) Је dx (b) f. sin x dx 
o 


Therefore, 


(Note that J,; gives better estimate than /,, when n = 2. This is because 
14 uses quadratic equation while /,, uses a linear one) 
Case (b) 


a/2 
T= [sini dx = 1/4 
0 
=F LAO + 4 f(A) + f (2) 


= 0.2617993(0 + 3.3635857 + 1) 
423841 
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Composite Simpson's 1/3 rule 

Similar to the composite trapezoidal rule, we can construct a composite 
Simpson's 1/3 rule to improve the accuracy of the estimate of the area. 
Here again, the integration interval is divided into n number of seg- 
ments of equal width, where n is an even number, Then the step size is 


As usual, x; = a + ih, i = 0, 1, ... n. Now, we can apply Eq..(12.12) to each 
of the n/2 pairs of segments o or subintervals (xo; _ 9, xg _ 1), (Хы -p Хш). 
This gives 


La 


nt 
= Иба 2) tfen) + fora 
i 


= E AG + af, + Ж + Afa 2 аж + FON 


On regrouping terms, we get 


' = ETA ez z here] (12:14) 


An analysis similar to the error analysis of composite trapezoidal rule 
can be performed to obtain the error due to truncation in composite 
Simpson's 1/3 rule. 
[I 
s nF = ——— 12.15) 
Els E] 180п P. ^ 


where F is the maximum absolute value of the fourth derivative of f(x) 
in the interval (a, 6), 


_шш 


Compute the integral 


А 


|} )4х 


applying — 's 1/3 rule for п = 4 and п = 6 with an accuracy to five 
decimal places. 


The composite Simpson's 1/3 rule is given by 


лал 
i = К x + Ў fx, -42 P ifi 


Fornz4,h-nl8 
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There аге five sampling points given by x, = Ёл/8, k = 0, 1,..., 4. Substi 
tuting the values of x, in the composite rule, we get, 


а = dE UO) + fGf2) + А/В) + 40808) + (ЯН 


= er [0 + 1.0 + 4(0.61861 + 0.96119) + 2 (0.84090)] 


= 1.17823 
Forn = 6, hz л/12 
There are seven sampling points given by x, = kx/12, k = 0, 1j 54$. 
Substituting these values in the above equation, we get 


1а gg + 1.0 + 4(0.50874 + 0.84090 + 0.98282) + 200.70711 + 0.93060) 


= 1.18728 


Program SIMS1 


Program SIMS1 integrates a given function using the composite Simpson's 
1/3 rule. Note that, unlike TRAPE1 which requests for segment width h, 
SIMS1 requests for number of segments n. Remember, n should bo even. 
This program also uses a function subprogram which can be easily 
replaced for any other function without modifying the main program 
" 


* 


PROGRAM SIMSI 


назара аа i E E EEE EENE * 
Main program * 
This program integrates a given function + 
using the Simpson's 1/3 rule bi 
* 


* Variables used $ 
* A - Lower limit of integration = 
* B - Upper limit of integration + 
* H - Segment width 1 
* N - Number of segments 4 
* ICS - Value of the integral i 

+ 

. 


* Constants used 
d NIL 
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INTEGER N,M 
REAL A,B,H, SUM, ICS, X, F1,F2,F3 
EXTERNAL F 
WRITE (*,*) ‘Initial value of x 
READ (*,*) A 
WRITE(*,*) ‘Final value of x' 
READ(*,*) B 
WRITE(*,*) ‘Number of segments (EVEN number)’ 
READ(*,*) N 
Н = (B-A)/N 
0.0 
F(X) 
= 1M 
P(X+H) 
F(X+2*H) 
SUM = SUM + Fl + 4*P2 + F3 
Fl = F3 
X= X + 2*H 
10 CONTINUE 
TCS = SUM * H/3.0 
WRITE(*,4) 
WRITE(*,*) INTEGRAL FROM’, A, * TO', B 
WRITE(*,*) 
WRITE(*,*) ‘WHEN H = мд. o IS', ics 
WRTTE(*,*) 
STOP 
END 


----Fnd of main SIMSI 


Function subprogram F(X) e 


REAL FUNCTION Р(Х} 
REAL X 


Е = 1-EXP(-X/2.0) 


RETURN 


End of function F(x} ---.-____ 
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Test Run Results Output of the program for integrating the function 
Ро) =1-e* 
from 0.0 to 10.0 is given below: 


Initial value of x 


0.0 

Final value of x 

10.0 

Number of segments (EVEN number) 

20 

TNTEGRAL FROM -0000000 то 10.0000000 
WHEN H = 5.000000£-001 Is 8.0134330 


Stop - Program terminated. 


Bs SIMPSON'S 3/8 RULE 


Simpson's 1/3 rule was derived using three sampling points that fit a 
quadratic equation. We can extend this approach to incorporate four 
sampling points so that the rule can be exact for f(x) of degree 3. 
Remember, even Simpson's 1/3 rule, although it is based on three points, 
is third-order accurate. However, a formula based on four points can be 
used even when the number of segments is odd. 

By using the first four terms of Eq. (12.5) and applying the same 
procedure followed in the previous case, we can show that 


En 


= Ia) + З/\х,) + Bf tay) + F (12.16) 


|, 
L 


where А = (b - a)/3. This equation is known as Simpson's 3/8 rule. This 
is also known as Newton's three-eighths rule. 

Similarly, we can show that, using the fifth term of Eq. (12.5), the 
truncation error of Simpson's 3/8 rule is 


f - 
| BAS Lus Ф -а)5 a 
[Er " 7807“ (94)=-— О жй... | (12.17) 


vhere a < 8, < b. 
“or a given interval (a, b), the (runcation error of Simpson's 1/3 rule is 


5 
Е-ге) 


‘his shows that the 3/8 rule is slightly more accurate than the 1/3 rule. 
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2n ____ _ 
Use Simpson's 3/8 rule to evaluate 


2 EE 
(a) [а +104 (b) fysin@)dr 
1 o 


Basic Simpson’s 3/8 rule is based on four sampling points and, therefore, 
n=3. 


ГЖ ЗА fia) + SfGs) + ЗРО) + F) 


ху=а+л = 1+ 173 = 4/3 
х= а + 2h = 1+ 2/8 = 5/3 
on substitution of these values, we obtain 
Jaa i LA) + (2) + 304/8) + 315/3) 
= 4.75 


Note that the answer is exact. This is expected because Simpson's 3/8 
rule is supposed to be exact for cubic polynomials. 
Case (b) 


ча 
17. | ¥sin(x) dx 
a 
Here again, n = 3 and the integral is given by 
Lo = SÈ {fad + 3f) + 3f + (00) 


х=а+А = 


“ „Ж 
Xg=a+2h= 3 
on substitution of these values, we obtain 


Toa = тє [O + 3/8) + /‹л/З) + Fast) 


= 16 10 + 212132 + 2.79181 + 1.0] 
= 1.16104 
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Bei HIGHER ORDER RULES 


"There is no limit to the number of sampling points that could be incorpo- 
rated in the derivation of Newton-Cotes rule. For instance, we can use a 
five-point rule to fit exactly the function f(x) of degree 9 and so on. Since 
the repeated use of lower-order rules provide sufficient accuracy of the 
estimates, higher-order methods are rarely used. 

One more rule which is sometimes used is Boole's rule based on five 
sampling points. This is given by 


Д = A (fg + Зр, + 12р, + 32/, + Thy) | (12.18) 


where Л = (b – а)/4 
The truncation error of Boole's rule is 


| = BÀT reg.) (12.19) 
Ew =~ ogg f° 02 


Use Boole’s five-point formula to compute 


E 
| sin) dx 
5 


and compare the results with those obtained in previous examples. 


f= (2/8) = 0.61861 
fa= finia) = 0.84090 
fy = (3/8) = 0.96119 
fa= Ка) = 1.0 
ns E 10 + 32(0.61861 4- 0.96119) + 12(0.84090) + 7(1.0)] 
= 1.18062 
The table below shows the results of 


РЕ 
[Ума dx 
0 


obtained by various Newton-Cotes rules. 
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Rue — — n Result _ 
Trapezoidal (simple) | i 0.78540 
Trapezoidal (composite) 2 1.05314 
Simpson’s 1/3 (simple) 2 1.14238 
Simpson's 3/8 3 1.16104 
Simpson's 1/3 (composite) 4 1.17823 
Boole's rule 4 1.18062 
Simpson's 1/3 (composite) 6 1.18728 
Simpson's 1/3 (composite) 12 1.19429 


The estimate can be further improved by using still more intervals. 
Table 12.1 lists the basic Newton-Cotes rules. 


Table 12.1 Basic Newton-Cotes rules 


Name Intervals (n) Formula Error 
Trpeoidal 1 А+) -А fro 
9 o 1 12 
5 
Simpson's 1/3 2 4 (fo + 4f, +f) reo 
5 
Simpson's 38 З Gs a+ +h) peo 


^ h " BAT 
Boole’s rule 4 z (fa 39f, +12}; +32, + 10 eS l OO 


lis] ROMBERG INTEGRATION 


lt is clear from the discussions we had so far that the accuracy of a 
numerical integration process can be improved in two ways: 

1. By increasing the number of subintervals (i.e. by decreasing h)—this 
decreases the magnitude of error terms. Here, the order of the 
method is fixed. 

2, By using higher-order methods—this eliminates the lower-order 
error terms. Here, the order of the method is varied and, therefore, 
this method is known as variable-order approach. 

The variable-order method can be implemented using Richardson's 
extrapolation technique discussed in the previous chapter. As we know, 
this technique involves combining two estimates of a given order to 
obtain a third estimate of higher order. The method that incorporates 
this process (i.e. Richardson's extrapolation) to the trapezoidal rule is 
called Romberg integration. 

According to the Euler-Maclaurin formula, the error expansion for 
trapezoidal rule approximation to a definite integral is of the form 


b 
jf (дах - TCh) = ash? + a,h* + aghf +... (12.20) 
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Th) is the trapezoidal approximation with step size = (b — аул = A. 
Let us define 
Tih, 0) = Th) 


to indicate that T(h) is the trapezoidal rule with no Richardson's 
extrapolation being applied (zero level extrapolation). Thus, Eq. (12.20) 
can be written as 


T= Tih, 0) + ah? + a4h* + agh? +... (12.21) 


Let us have another estimate with step size = (b – a)/2n = h/2 (at zero 
level extrapolation) as 


ы Ss pa UN pg O8 kg 
I-T(h/2,0)- 1 A +тв” +32" +... (12.22) 


By multiplying Eq. (12.22) by 4 and then subtracting Eq. (12.21) from 
the resultant equation, we obtain (after rearranging terms), 


_ 47(h/2,0)-Tth, 0) 
ge 4-1 
= ТМО, 1) + bh + boh? +... (12.23) 


+byh4 +bah +... 


where 
4T(A/2,0) - ТОА, 0) 
3 

is the corrected trapezoidal formula using Richardson's extrapolation 
technique "once" (level 1). Note that its truncation error is of the order 
A‘, instead of h? which is the order in the “uncorrected” trapezoidal 
formula. 

Now, we can apply Richardson's extrapolation technique once more to 
Еч. (12.23) to eliminate the error term containing h*, The result would 
be 


ТА, 1) = 


_ 3674/4, D — T(h/2, 1) " 


1 16-1 


Ceh? + 
= Т(АЛА, 2) + Ceh? +... а2.24) 
where 


16Т(М4, 1)- Т(А/2, 1) 


м. = 
TUA, 2) 18-1 


is the estimate, refined again by applying Richardson's extrapolation a 
second time (level 2). Similarly, we can obtain an estimate with third- 
level correction as 


64T(h/8, 2) - T(h/4,2) 


A/8,3) = 
incus 64-1 
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The entire process of repeated use of Richardson's extrapolation tech- 
nique can be represented in general form as 


o LTZ, ]-1)—Т(М/2!-\ 
Tra р Т1 | (12.25) 


where i = 0, 1, 2 ... denotes the depth of division and j x i denotes the 
level of improvement. 
We can further simplify the notation of Eq. (12.25) by defining 


Ry = ТА", j) 


R,- 3106 4-17 ТРЕ! 
47-1 (12.26) 


Equation (12.26) is GUN as Romberg integration formula. Note that 
this equation, when expanded, will form a lower-diagonal matrix. The 
elements of the matrix R are computed row by row in the order indi- 
cated in Fig, 12.5. The circled numbers indicate the order of computa- 
tions and the arrows indicate the dependencies of elements. An element. 
at the head end depends on the element at the tail end. 


Thus, we have 


- F3) 


Fig. 12.5 Order of calculations for Romberg integration 


Elements in the first column represent trapezoidal rule at h, h/2, МА, 
etc. They can be evaluated recursively as follows: 


h=b-a 


R0, 0) = i (а) + f(5)] 


Ru, 0) £0 


aa 
+h, p fxa-) foi-12..| (227 
=1 
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where 
h; = (b-ay2 
хъ=а+ kh; 
Equation (12.27) is known as recursive trapezoidal rule. 


——  __ 


Compute Romberg estimate Roo for 


2 
[Uxdx 
i 


First we apply the basic trapezoidal rule PT obtain R00) 
RO, 0) = E Fia) + f(b) 


Sta + V2) = 0.75 


Now, we obtain R(1, 0) and R(2, 0) using equation (12.27) 


ва,® - 20 +, fp 
„йө iii. 
+925 1-1. =0.7083388 


R(2, 0) =—у— + half) + а) 


= d «Aras + fam) 
= 0.6970237 
Now, Romberg approximations can be obtained using Eq. (12.26). 


4R(1,0) - RO, 
RO, D= (1, 5 R(0,0) 


_ 4(0.7083333) - 0.75 
j 3 


4R(2, 0) - R(1,0) 
3 


_ 4(0.6970237) — 0.7083333 
лл лаш 


16R(2, 1)- ROD 
15 


= 0.6944444 


R(2, 1) = 


= 0.6932538 


(2, 2) = 
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_ 16(0.6932538) — 0.6944444 


15 = 0.6931744 
Correct answer = 7,(2) = 0.6931471 
Error = 0.0000273 
Program ROMBRG 


Computer algorithm for implementing Romberg integration is simple 
and straight-forward. Starting from the element R(0, 0), all the other 
elements are calculated row by row. The elements in the first column 
are calculate using the recursive trapezoidal rule (Eq. (12.27)) and the 
remaining clements are calculated using the Romberg integration for- 
mula (Eq. (12.26)). The process is terminated when two diagonal ele- 
ments R(i — 1, j — 1) and Ri, j) agree to the required level of accuracy. 

Program ROMBRG implements the Steps involved in Romberg inte- 
gration. 


PROGRAM ROMBRG 


* aia a a ti ree ase = = Ё. 


* Main prog: am 
ы This program performs Romberg integra 
by bisecting the intervals N times 


^ 
* Functions invoked 
*  F,ABS 


* E 
* Variables used * 
* A - Starting point of the interval * 
ha B - End point of the interval Ы 
Ф Н - Width of the interval 
з N - Number of times bisection is done * 
d M - Number of trapezoids + 
* R - Matrix of Romberg integral values * 
" s * 
* Constants used ^ 
*  EPS - Error bound ^ 


REAL A,B,EPS, H, R, SUM, F, X, ABS 
INTEGER N,M 

INTRINSIC ABS 

EXTERNAL F 

PARAMETER( EPS = 0.00001 ) 
DIMENSION R(10,i0) 
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WRITE(*,*) ‘Input endpoints of the interval’ 
READ(*,*) А,В ' 

WRITE(*,*) ‘Input maximum number of times’ 
WRITE(*,*) ‘the subintervals are bisected’ 


READ(*,*) М 
* Compute using entire interval as one trapezoid 
Н = B-A 
R(1,1) = H * (F(A) + FIB))/2.0 
WRITE(*, *) 


WRITE(*,*) R(1,i 
DO 30 І = 2, NH 


» Determine number of trapezoids for I th refinement 
M - 2**(1-2) 

* Reduce step size for I th refinement 
H = H/2 


* Use recursive trapezoidal rule for M strips 
SUM = 0.0 
DO 10 К = 1,M 
X = A+(2*K-1)°H 
SUM = SUM + F(X) 
R(I,1) = R(1-1,1)/2.0+H*SUM 
10 CONTINUE 


^ Compute Richardson's improvements 
DO 20 L = 2,I 
R(I,L)- В(Т,1 
20 CONTINUE 


* Write the results of improvements for I th refinement 
WRITE(*,*) (R(LLL),L = 1,1) 


* (RII, L-I) - RUI-1, L-1) )/ (4** (5-1) -1) 


* Test for desired accuracy 
IF(ABS(R(I-1,1-1) - R(T,I)) .LE. EPS) THEN 
Е Stop further refinement 
WRITE(*, *) 
WRITE(*,*) ‘ROMBERG INTEGRATION -', R(I,1) 
WRITE (*, *) 
Go TO 40 
ENDIF 


* Continue with the refinement process 
30 CONTINUE 


* write the final result 
WRITE(*,*) 
WRITE(*,^) 'ROMBERG INTEGRATION = ', R(N+1,N+1) 
WRITE(*,*) '(Exit from loop)’ 


WRITEl*,* 


REAL FUNCTION Р(Х) 


REAL X 

Р = 1.0/Х 

RETURN 

END 
* --------------- End of function F(X) --------------- * 
Test Run Results 


First run 
Input endpoints of the interval 
12 
Input maximum number of times 
the subintervals are bisected. 


7,500000E-001 
7.083334E-001 6.944445E-001 


ROMBERG INTEGRATION - 6.944445E-001 
(Exit from loop) 


Second run 
Input endpoints of the interval 
12 
Input maximum number of times 
the subintervals are bisected 
2 


7.500000E -001 
7.083334E - 001 6.944445E -001 
6.970239Е -001 6.932541E -001 6.931747Е-001 


ROMBERG INTEGRATION = 6.931747E-001 


(Exit from loop) 


p GAUSSIAN INTEGRATION 


We have discussed so far a set of rules based on the Newton-Cotes 
formula. Recall that the Newton-Cotes formula was derived by integrat- 
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ing the Newton-Gregory forward difference interpolating polynomial. 
Consequently, all the rules were based on evenly faced sampling points 
(function values) within the range of integral. 

Gauss integration is based on the concept that the accuracy of nu- 
merical integration can be improved by choosing the sampling points 
wisely, rather than on the basis of equal spacing. For example, consider 
a simple trapezoidal rule as shown in Fig. 12.6(a). Here, the end points 
of the integral lie on the function curve. Now, consider Fig. 12.6(b). 
Here, the straight line has been moved up such that area B = A +С. 
Notice that the sampling points are moved away from the end points. 
The function yalues at the end points are not used in computation. 
Rather, function values /(x,) and f(x.) are used to compute the shaded 
area. It is clear that the area obtained from Fig. 12.6(b) would be much 
closer to the actual area compared to the shaded area in Fig. 12.6(a), 
The problem is to compute the values of x, and x, given the values а and 
b and to choose appropriate “weights” w, and wz. The method of imple- 
menting the strategy of finding appropriate values of x, and ш, and 
obtaining the integral of f(x) is called the Gaussian integration or quadra- 
ture. 


то) 


fb) Gaussian rule 


Fig. 12.6 Gaussian integration 
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Gauss integration assumes an approximation of the form 
1 n 
1, = [fla)dx= $w; fla) (12.28) 
a 11 


Equation (12.28) contains 2л unknowns to be determined. These un- 
knowns can be determined using the condition given in the integration 
formula (12.28). This should give the exact value of the integral for 
polynomials of as high a degree as possible. 

Let us find the Gaussian quadrature formula for n = 2. In this case, 
we need to find the values of шу, Ws, x, and x, Let us assume that the 
integral will be exact up to cubic polynomials. This implies that the 
functions 1, x, х2 and х? can be numerically integrated to obtain exact 
results 


1 
w + шу = Ј9х=2 
шуху + шуу = [xdx =0 


| 
их} + wer} = fx? de=2 
i 


А 
jz ах=0 
| 


шух) + Wy 


Solving these simultaneous equations, we obtain 


wy = ш =1 
1 


x= —0.5113502 


xy = -L = 0.5773502 


48 


Thus, we have the Gaussian quadrature formula for n = 2 as 
у 
Гео) ах = (08) + r(u48) (12.29) 
à 


This formula will give correct value for integral of f(x) in the range (-1, 
1) for any function up to third-order. Equation (12.29) is also known as 
Gauss-Legendre formula. Two-point Gauss quadrature is illustrated in 
Fig. 12.7. 
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n 
T= | exp(x)dx 
EI 


= fix) + f) 
where x, and x; are Gaussian quadrature points and are given by 


x, 2--l. =—0.5773602 


43 


+L = 0.5773502 


V3 


X 


Therefore, 
Izexp(-0.5773502) + exp(0.5773502) 
= 0.5613839 + 1.7813122 
= 2,3426961 


Changing Limits of Integration 


Note that the Gaussian formula imposes a restriction on the limits of 
integration to be from –1 to 1. This restriction can be overcome by using 
the technique of "interval transformation" used in calculus, Let 


b 1 
[fco dx = С [ gle) dz 
а a 
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Assume the following transformation between x and the new variable z. 


x=Az+B 
This must satisfy the following conditions: 
At 
х=а, z--l andr-b, z=1 
That is 
В-А=а 
A*B-b 
Then 
ast andi pat? 
Therefore 
b-a a+b 
ee aso 


This implies that 


b-a 


Ges 


Then the integral becomes 


The Gaussian formula for this integration is 


b-a (b. 
2 


where ш, and г, are the weights and quadrature points for the integra- 
tion domain (-1, 1) 


ШШ:  _ 


Compute the integral 


1 - 
Јасаг = ;2 Swe) 
л isl 


2 
I= je-*?àx 
E 


using Gaussian two-point formula. 


п= 2 and therefore 
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"" 
1,73 3^ шша) + sg 


Ь+а , 
д" 
Therefore, 
i gu) se * =e 
For a two-point formula 
Wy =We=1 
1 
2; aT 
1 
z= 
© уз 


‘Upon substitution of these values, we get 
1, = 2 exp -1//3) + exp(/ A3) 
= 4,6853922 


Higher-Order Gaussian Formulae 

By using a procedure similar to the one applied in deriving two-point 
formula, we can obtain the parameters w, and г, for higher-order ver- 
sions of Gaussian quadrature. These parameters for formulae up to an 
order of six are Labulated in Table 12.2. 


Table 12.2 Parameters for Gaussian integration 


n i ш, а 
2 1 1.00000 -0.57735 

2 1.00000 0.57735 
3 1 0.55556 -0.77460 

2 0.88889 0.00000 

3 0.55556 -0.77460 — 
4 1 0.34785 -0.86114 

2 0.65215 -0.33998 

3 0.65215 + 0.33998 

4 0.34785 0.86114 
5 1 0.23693 -0.90618 

2 0.47863 -0.53847 

3 0.56889 0.00000 

4 0.47863 0.53847 

5 0.23693 0.90618 


(Contd.) 
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Table 12.2 (Contd.) 


п i wi а 
1 0.17132 -0.93247 
2 0.36076 -0.66121 
3 0.46791 -0.23862 
4 0.46791 0.23862 
5 0.36076 0.66121 
6 0.17132 0.93247 
gan En sl nl | 


B Шш 


Use Gauss-Legendre three-point formula to evaluate 


4 
f(x4+Ddx 
2 

Given n= 3, a = 2, and b = 2. Hence 


b-a d 
т," Ente) 


= шугу) + wogtz;) + wagizs) 


_@-а) bea 
RAS de зыр *3 


"Therefore, 
giz) = (2+ 3) +1 
For n = 3, we have 
w, = 0.55556 2, = –0.77460 
Wy = 0.88889 22=0.0 
ws = 0.55556 zy = 0.77460 
Then 
1, = 0.55556 [(-0.77460 + 3)* + 1] 
+0.88889 [(0+3)* + 1] 
+0.55556[(0.77460 + 3)* + 1] 
= 14.18140 + 72.88898 + 113.33105 
= 200.40143 
We can verify the answer with analytical solution which is 200.4. Note 
that three-point Gauss formula should give exact answer for a second 
order polynomial. The difference in the answer is due to roundoff errors. 
Roundoff error can be minimised by increasing the precision of Gaussian 
parameters. 
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Algorithm 12.1 gives а simple procedure for implementing Gauss- 


Legendre formula. 
Gaussian Integration 


Define the function, fix) 
. Obtain integration limits (a, b) 
„ Decide number of interpolating points (л) 
. Read the Gaussian parameters (w, z) 
. Compute x using 


зом ә 


x 


6. Compute hy 


7. Write result 


Algorithm 12.1 


J| SUMMARY 


In this chapter, we discussed the integration of definite integrals using 
numerical integration techniques. The following Newton-Cotes methods 
were considered in detail: 

* Trapezoidal rule 

+ Simpson's 1/3 rule 

• Simpson's 3/8 rule 

* Boole’s rule 

We also presented a method known as Romberg integration to im- 
prove the accuracy of the results of the trapezoidal method. 

We finally discussed another approach known as Gauss integration 
which is based on the concept that the accuracy can be improved by 
choosing the sampling points wisely, rather than equally. 

FORTRAN programs were presented for the following methods: 

* Trapezoidal rule 

* Simpson's 1/3 rule 

e Romberg integration 


Key Terms 


Newton's three-eighths rule 
Newton-Cotes formula 


(Contd.) 
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Composite approach 
Composite Simpson's 1/3 rule 
Composite trapezoidal rule 
Extrapolation 

Gaussian integration 
Gaussian quadrature 
Gauss-Legendre formula 
Gauss-Legendre rules 
Integration nodes 


Lagrange interpolation polynomial 


Multisegment approach 


Newton interpolation polynomial 


Newton-Cotes rules 
Numerical integration 
Numerical quadrature 
Open form 

Recursive trapezoidal rule 
Richardson's extrapolation 
Romberg integration 
Romberg integration formula 
Simpson's 1/3 rule 
Simpson's 3/8 rule 
Trapezoidal rule 
Variable-order approach 


What is numerical integration? 
еп do we need to use a numerical method instead of analytical 
"method for integration? 
З. Numerical integration is similar in spirit to the graphical method 
of finding area under the curve. Explain, 
Explain the basic principle used in Newton-Cotes methods. 
@ Describe the trapezoidal method of computing integrals. 
.! What is composite trapezoidal rule? When do we use it? 

7, In composite trapezoidal rule, the crror is estimated to be inversely 
proportional to the square of number of segments. That is, we can 
decrease the error by taking more and more segments. But this 
does not happen always. Why? Explain. 

8. Describe Simpson’s method of computing integrals, 

9. Prepare a flow chart for implementing the trapezoidal rule. 

10. Prepare a flow chart for implementing Simpson's one-third rule. 

11. Show that Simpson’s one-third rule is exact up to degree 3, 

12. Derive Simpson's three-eighth's rule using the first four terms of 
Newton-Gregory forward formula. 

13. State formulae and errcr terms for the four basic Newton-Cotes 
rules. 

14. How could we improve the accuracy of a numerical integration 
process? 

15. What is Romberg integration? How does it improve the accuracy of 
integration? 

16. Explain the concept used in Gaussian quadrature. 
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Лебане analytically the following integrals: 


2 

(а) [(3x? «2x -5) d 
о 
2 

O) |(3х% + 202 - рах 
в 


1 
(©) [(3cosx +5) dx 
P o 


XY Evaluate the integrals in Exercise 1 using the basic singlo segment 
trapezoidal rule. 
3, Evaluate the integrals in Exercise 1 using the basic Simpson’s 1/3 
rule, 
4. Evaluate the integrals in Exercise 1 using the basic Simpson's 3/8 
rule. 
+8 Evaluate the integrals in Exercise 1 using multiple application of 
the following rules with n = 4. 
(a) Trapezoidal rule 
(b) Simpson's 1/2 rule 
(c) Simpson's 3/8 rule 
Use the trapezoidal rule with n = 4 to estimate 


1 
dx 
Ere 
correct to five decimal places. 
A Use Simpson’s method with n = 4 to estimate 
pile 
о 1+x? 


correct to five decimal places. Compare this with the result ob- 
tained in Exercise 6. How do they compare with the correct answer 


Ps 
A 0.785398. 


8.) Estimate the following integrals by (a) trapezoidal method and (b) 
^—^ Simpson's 1/3 method using the given n: 


3 ч 
(a | %, n=2,4,8 
1* 


Ope жыз 
iS 
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5 
(о) fe dx, n=8 
1 


mom n-6 
a 


(e) Í з соз? x dx, n=6 
a 


2 


2 
(D jle? Ddx, n=8 


0 
We table below shows the temperature f(t) as a function of time. 


- 
Timet — з га = [а Те ре] 
Temperature, f| 81 | 75 | 80 | 83 | тв | 70 60 | 


(a) Use Simpson’s 1/3 method to estimate 


> 


3 
J fae 
1 


(b) Use the result in (a) tp estimate the average temperature, 
10. Use Romberg integration to evaluate 


E 


w [tar 
1 x 
25 

(d) [(5*2sinx)dx 
0 


2 
(e) | te=- Ddx 
° 
11. Prove that if f” (x) > 0 and a < х < b, the value of the integral 


n 
Гах 
а 
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by the trapezoidal rule will always be greater than the exact value 
of the integral. Verify your conclusion for the following functions: 
(a) f(z) = 2° 
(b f(z) =L+x 427 
‘The table helow shows the speed of a car at various intervals of 
time. Find the distance travelled by the car at the end of 2 hours 


‘Time, hr 0 T 05 | 10 15 T 20 25 
Speed, km/hr | 0 | 40 | 60 50 | 45 65 
L 1 1 
с 
13. The velocity of a particle is governed by the law 
sin(t) 
AD = Tt exp 


If the initial position of the particle is x(0) = 0, then estimate the 
position x(2) using the integral 


n 
x(t) = [0947 
0 


by applying a suitable Newton-Cotes formula. 
14. Estimate the integral 


ie For 
= | exp) —; |d: 
1 23) х 
by Gauss qudrature, with п = 2, 3, and 4. 
15. Evaluate the integral 
ni? 


= f (1 -0.25 еіп)! dx 
о 


using Gaussian quadrature, Assume a suitable value of n. 
16. In an electric circuit, the voltage across the capacitor is given by 


1 Е 
ЫТ) == | ile) dt volts 
[4 0 


T 
E f sin? (4) volts 
0 


Assuming C = 5F, compute the value of voltage v(T) for Pete 
and 3 seconds. 
17. The circumference of an cllipse is given by 


шейге Jess 
a? 


Circumference - 
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18. 


where 2a is the length of major axis and 2 is the length of minor 
axis. Find the circumference if a = 30 metres and b = 20 metres, 
The viscous resistance of an object moving through a fluid with 
velocity v is given by 
R=- 


The velocity is decreasing with time t. The time taken for the veloc- 
ity to decrease from v, to v, is given by 


v 


m 
у: dv seconds 


vj 
T= J Jg duseconds = 


where m is the mass of the object. Estimate the time T required for 
an object with m = 30 kg to reach a velocity of 10 m/sec from an 
initia) velocity of 20 m/sec. 


Write a modular program TRAPEZ that uses the following subpro- 
grams as modules to evaluate integrals using the composite trap- 
ezoidal rules. 

(a) Input module (Module1) 

(b) Evaluation module (Module2) 

(с) Output module (Module3) 
Use a function subprogram to evaluate the given function, 
Develop а modular program SIMS?2 (similar to TRAPE2) to evalu- 
ate integrals using multiple application (i.e. composite) of Simp 
son’s 3/8 rule. 


. Modify the program SIMPS1 to include a test to determine whether 


п is even and stop the execution if n is odd. 


. Write a program that uses two-point Gaussian quadrature to esti- 


mate the integral 


b 
[fid 


Split the interval into n equal subintervals and apply the quadra- 
ture rule to each subinterval. 


‚ Show that the integral 


ajr? ad, n=1,2,.. 
0 


can be evaluated recursively by 
ai=- у вә 
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Write a program to evaluate Г.о using this recursive formula anc 
compare the results by Simpson’s rule. 

. Write a program to evaluate the integral of a table of points using 
Newton's three-eighth’s rule. 

. Write a program to experiment with the integration problem 


I=] eost) dz 
0 


to see if you can determine an optimum value for /t using Simpson's 
rule, 


Note: The trapejoidal rule and the Simpson’s 1/3 rule can be used to 
evaluate the integral of a table of points. Development of 
FORTRAN programs is left as an exercise to the readers. (C pro- 
grams for evaluating the integral of tabulated data are given in the 
Appendix D). 


Numerical Solution of 
Ordinary Differential 
Equations 


————M 


е NEED AND SCOPE 


Many of the laws in physics, chemistry, engineering, biology and eco- 
nomics are based on empirical observations that describe changes in the 
states of systems. Mathernatical models that describe the state of such 
systems are often expressed in terms of not only certain system param- 
eters but also their derivatives Such mathematical models, which use 
differential calculus to express relationship between variables, are known 
as differential equations 

Examples of differential equations are many. A few of them are listed 
below to illustrate the nature of differential equations that occur in 
science and engineering. 
1, Law of cooling 
The Newton's law of cooling states that the rate of loss of heat from a 
liquid is proportional to the difference of temperatures between the liq- 
uid and the surroundings. This сап be stated in mathematical form as 


STU) atr. qu (103.1) 
di 


where T, is the temperature of surroundings, 710) is the temperature of 
liquid at time £ and Ё is the constant of proportionality. 

2. Law of motion 

The law governing the velocity ví?) of a moving body is given by 
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dult) 
a T (13.2) 
where m is the mass of the body and F is the force acting on it. 


3. Kirchhoffs law for an electric circuit 
‘The voltage across an electric circuit containing an inductance /, and a 
resistance R is given by 


di, 
Lg Y (13.3) 
4. Radioactive decay 


The radioactive decay of an element is given by 


ur -Ет = 0 (13.4) 


where m is the mass, t is time and Ё is the constant rate of decay. 


5. Simple harmonic motion 
The equation to describe a simple harmonic motion is given by 


d'y ду 5 
mud tage 9-9 (13.5) 


where y denotes displacement and m is the mass. Note that d*y/ar 
represents acceleration and dy/di represents velocity of the moving weight. 
6. Force on a moving boat 

When a boat moves through water, the retarding force is proportional to 
the square of the velocity, The acceleration is given by 


(13.6) 


where ул is the mass and № is the drag coefficient. 


7. Heat flow in a rectangular plate 
The model for heat flow in a rectangle plate that is heated is given by 


32 32 
Zu uo 


ёх? dy? 


where u(x, y) denotes the temperature at point (x, y) and fix, y) is the 
heat source, 

Note that all these examples contain the rate of change of a variable 
expressed as a function of variables and parameters. Although most of 
the differential equations may be solved analytically in their simplest 
form, analytical techniques fail when the models are modified to take 
into account the effect of other conditions of real-life situations, In all 
such cases, numerical approximation of the solution may be considered 
ав a possible approach. 


fix. у) (13.7) 
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The main concern of this chapter is to present various methods of 
numerical solution of a class of differential equations known as ordinary 
differential equations. This implies that the differential equations may 
appear in different forms. This is evident from *he ex: mples. We, there- 
fore, define some important classes of differeuual equations before con- 
sidering the numerical solution of ordinary differential equations. 


Number of Independent Variables 


The quantity being differentiated is called the dependent variable and 
the quantity with respect to which the dependent variable is differenti- 
ated is called independent variable. If there is only one independent. 
variable, the equation is called an ordinary differential equation. If it 
contains two or more independent variables, the derivatives will be par- 
tial and, therefore, the equation is called a partial differential equation. 
Eqs. (13.1) to (13.6) belong to the class of ordinary differential equations 
while Eq. (13.7) is a partial differential equation. In this chapter we 
shall consider only the ordinary differential equations. 


Order of Equations VON vd 


Order of Equations , 
Differential equations are also classified according Uo their order. The 


order of a differential equation is the highest derivative that appears in 
the equation. When the equation contains only a first derivative, it is 
called a first-order differential equation. For example, Eqs (13.1) to (13.4) 
are first-order equations. On the other hand, if the highest derivative is 
a second derivative, the equation is called a second-order differential 
equation. For example, Eq. (13.5) is a second-order equation. 

A first-order equation can be expressed in the form 


д = fix, у) (18.8) 


A second-order equation can be expressed in the form 
yr = fi, у, уз (13.9) 


where y” denotes the second derivative and y’ is the first derivative. 
Higher-order equations can be reduced to a set of first-order equations 
by suitable transformations. For example, the equation 


= fa, њу A} 
can be equivalently represented by 


ч' = fix, y, u) 


Degree otequations ON UD 


Sometimes, the equations аге reforred to by their degree. The degree of 
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a differential equation is the power of the highest-order derivative. For 
example, 
xy” + yy’ = 2у +8 
is a first-degree, second-order equation while, 
("P + 5у = 0 
is a second-degree, third-order equation. 


linear and Nonlinear Equations (7 № Јо 


A differential equation is known as а linear equation when it does not 
contain terms involving the products of the dependent variable or its 
derivatives, For example, 


у" + 3у' = 2у +х? 
is a second-order, linear equation. The equations 
у" + 6-1 
y =-ау? 
are nonlinear because the first one contains a product of у' and the 
second contains a product of y. 


General and Particular Solutions 


A solution to a differential equation is a relationship between the de- 
pendent and independent variables that satisfy the differential equa- 
tion, For example, 


у= Зх +x 
ів the solution of 
у'=бх+1 
Similarly, ' 
у=." 
is the solution of 
y'=y ` 


Note that each of the solutions given above is only one of an infinite 
number of solutions. For example, 


у= 30 +х+2 
у= Зх? +x-10 


are also solutions of y' = бх + 1. In general, у = бх + 1 has a solution of 
the form 


у= За кх жс 


where с is known ав the constant of integration. Similarly, у = у has a 
solution of the form у = ае", The solution that contains arbitrary con- 
i i is therefore known as the general solution. 
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If the values of the constants are known, then, on substitution о: 
these values in the general solution, a unique solution known as particu- 
lar solution can be obtained. 


Initial Value Problems 4 — qwe— 


order to obtain the values of the integration constants, we need addi- 
tional information. For example, consider the solution у = ae* to the 
equation у’ = y. If we are given a value of y for some x, the constant a 
can be determined. Suppose y = 1 at x = 0, then, 


30) = ае? 
Therefore, 
azl 
and the particular solution is 
yee 


If the order of the equation is n, we will have to obtain n constants and, 
therefore, we need n conditions in order to abtain a unique solution 
When all the conditions are specified at a particular value of the inde- 
pendent. variable x, then the problem is called an initial-value problem, 

It is also possible to specify the conditions at different values of the 
independent variable. Such problems are called the boundary-value prob- 
‘ems. Por example, if, instead of specifying only y(0) = 1, we also specify 


x0) + y(1) = 2, then the problem will be а boundary-value problem. In 

this сизе, T M 
yO) + (1) =а(1 «e)-2 

giving ee ен: 


21 +e) 
One-step and Muliistep Methods 


All numerical techniques for solving differential equations involve a se- 
ries of estimates of yix) starting from the given conditions. There are two 
basic approaches that could be used to estimate the values of yx). They 
are known as one-step methods and multistep methods. 

in one-step methods, we use information from only one preceding 
point, i.e. to estimate the value у, we need the conditions at the previ- 
ous point y, ; only. Multistep methods use information at two or more 
previous steps to estimate a value. 


Scope 
Tn this chapter, we mainly concentrate on the solution of ordinary differ- 
entíal equations and discuss the following methods: 

1. Taylor series method 

2. Euler's method 

3. Heun's method 
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4. Polygon method 

5, Runge-Kutta method 

6. Milne-Simpson method 

7, Adams-Bashforth-Moulton method 
The initial-value problem of an ordinary first-order differential equation 
has the form 


Гу = Лх, убх)), y) = Уш (13,10) 
ee 250) 


In this chapter, we determine the solution of this equation on a finite 
interval (xy, b), starting with the initial point xy. For the sake of simplic- 
ity, in a number of places, we use f for f (x, КОСА and y? for y^). 


Esai TAYLOR SERIES METHOD 


We can expand a function у(х) about a point x = хо using Taylor's theo- 
rem of expansion 


2 Y" o) 
yx) = уб) + (x — xo) у' (ху) + (x = xo)? E EL 


"m e (1321) 


where у (хо) is the ith derivative of у(х), evaluated at x = хо. The value of. 
у(х) can be obtained if we know the values of its derivatives. This im- 
plies that if we are given the equation 

y'zfiy) (13.12) 
we must then repeatedly differentiate f (x, y) implicitly with respect to x 
and evaluate them at ху. 
For example, if y ' = f(x, y) then 


„s-a (dx) a 


dx 


d 
"Efe ne Ses уй у 
_of of _ 
ae ape = + Рх Ру (13.13) 


where / denotes the function f(x, у) and f, and f, denote the partial 
derivatives of the function f(x, y) with respect to x and y, respectively. 
Similarly, we can obtain 

y" =fat ffy Phe TI (13.14) 
Let us illustrate this through an example. 
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Consider the equation 
у'=лх®+у? 
under the condition у(х) = 1 when x = 0, 


at x = 0, (0) = 1 and, therefore, 
y(0)=1 
"000 = 2 
У"'(0) = 2 + (201002) + (31) = 8 
Substituting these values, the Taylor series becomes 


yos lexexte Baty. (13.15) 


The number of terms to be used depends on the accuracy of the solution 
needed. 


wr? 
BH. — Md EN 
Use the Taylor method to solve the equation 


PET 
for x = 0.25 and x = 0.5 given y(0) = 1 


The solution of this equation is given by Eq. (13.15). That is, 


эд=1+х eat ква + 
Therefore, 
(0.25) 0.25 + (0.25 + H (0.25) +... 

= 1.33333 

Similarly, 
2005) = 1+ 0.5 + 0.5? + 8 (05° +... 

= 1.81667 

Improving Accuracy 


The error in Taylor method is in the order of (x – х0)" * 1. If |x - xol is 
large, the error can also become large. Therefore, the result of this 
method in the interval (rọ, Б) when (b — xp) is large, is often found 
unsatisfactory. 
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The accuracy can be improved by dividing the entire interval into 
subintervals e 23), (ху, X2), (x, 25), n of equal length and computing 
y), i = 1, 2, ..., n successively, using the Taylor series expansion. Here, 
Убх) is used as an initial condition for computing y(x;,,). Thus, 


) 
жб) = ya) + У-Н” wy cag a? 


(x, 
T! 
yim) ( 


) 
ad РЕ (13.16) 


If we denote the size of each subinterval as h, then, 
xih  forizQ 1l. n-1 
and Eq. (13.16) becomes 


Jin =N 
L е — 


3.1%) 


The derivatives y/* are determined using Eq. (13.12), (13.13) and (13.14) 
at x = x, and y = у. This formula can be used recursively to obtain y; 
values. 


B — 8 18 


Use the Taylor method recursively to solve the equation 
y=x +y,  yQ- 
for th the interval (0, 0. 4) usi 


rvals of si 


g twi 


‘The derivatives of y are given by 


ysrt.y! ww: 
= 2х +2уу' 4 
" = 2 + Ду) + 2уу" "aa, 0^ 
on 6y’ y+ дуу'" 
Iteration 1 
Ур yo E 
һ= 0.2,» ais = 


у= 700) =0+ m = 
yë = У'0) = 2 х 0 + 2 x у(0) ху(0) = 
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Similarly, 
N =2 
X o 
Therefore, 
э=0+0+0+ S 0.2) +0 
= 0.002667 (atx = 0.2) 
Iteration 2 
x, 50.2 
у = 0.002667 


- yi =x? + y? = (0.2) + (0.002667)? = 0.04 
УГ = 2х, + 2ууу{ 
= 2(0.2) + 2(0.002667) (0.04) 
= 0.400213 
У 224 WyiP + ayy} 
= 2 + 200.04) + 2(0,002667) (0.400213) 
= 2.005335 


yi” =бу yp +2ууу" 
= 6(0.04) (0.400213) + 2(0.002667) (2.005335) 
= 0.106748 


э 2°” 
Ya=yı+yih E TM T h? + a 
= 0.002667 + 0.04(0.2) + 9100213 0.2? 


n oma 


777(0.2)3 + 


0.106748 „4 
BEC a (0.2) 


= "oa 
That is 
(0.4) = 0.021352 
If we use h = (b — хо) = 04 (without subdividing), we obtain 


(0.4) = i (0.49 = 0.021333 


The correct answer to the accuracy shown is У(0.4) = 0.021359. It shows 
that the accuracy has been improved by using subintervals. The accu- 
тасу can be further improved by reducing h further, say, k = 0.1 
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One major problem with the Taylor series method is the evaluation of 
higher-order derivatives. They become very complicated. All these de- 
rivatives must be evaluated at (x;, y), i = 0, 1, 2, .... This method is, 
therefore, generally impractical from a computational point of view. How- 
ever, it illustrates the basic approach to numerical solution of differen- 
tial equations. 


Picard's Method 

Consider the differential equation 
dy _ 
dr b» 


We can integrate this to obtain the solution in the interval (xy, x) 


z А 
[ау = | f(x, y)dx 
Ра žo 


or 
z 
у(х) = ya) + | [(х,у)йх 
E 


Since y appears under the integral sign on the right, the integration 
cannot be formed. The dependent variable y should be replaced by either 
à constant or а function of x. Since we know the initial value of y(at x = 
ху), We may use this as a first appropriation to the solution and the 
result can be used on the right-hand side to obtain the next. appropria- 
tion. The iterative equation is written as 


y =ya+ f f(xy) dx (13.18) 
žo 


Equation (13.18) is known as Picard's method. Since this method in- 
volves actual integration, sometimes it may not be possible to carry out 
the integration. Example 13.3 illustrates the application of Picard’s meth- 
od, 

It can be seen that Picard's method is not convenient for computer- 
based solutions. Like Taylor's series method, this is also a semi-numeric 
method. 


С Ын 0 8 


Solve the following equations by Picard's method 
fi) у(х)=х%®+у%, 90) =0 
(i) у(х) = хе, (0) =0 
and estimate y(0.1), (0.2) and y(1) 
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@ уб) =} + y! 
Yo=9, х5=0 


= yo+ | x2 +(у°)%)4= 


ET 


This process can be continued further although it may be a difficult task. 
If we stop at y’, then 


Er ud 
yx) = 54 Fe 


(0.1) = 000003333 
(0.2) = 0.026667 
У(1) = 0.3492063 
Gi) y Ge) = xe” 
yo70, x= 0 


y? =0+ f xe%dx= 
о 


JP sos] хеч! da etm с 
0 


Note that further integrations will become more difficult and even im- 
possible, Now, let us assume 


ya) = y? =e] 
(0.1) = 0.0050125 
(0.2) = 0.0202013 
У(1) = 0.6487213 
We know that the exact solution of y'(x) = хе? is 


xo =-In(1- =>) 
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Therefore 
»00.1), „а = 0.0050125 


%00.2),,„а = 0.0202027 
IH Vexact = 0.6933147 


Note that the error increases when (х ~ x) increases. Better accuracy 
can be achieved by using the new initial value" i.e. у (0.1) can be used 
as the initial value for computing y(0.2), instead of y(0). 


EES EULER'S МЕ OD 


Euler's method is the simplest one-step method and has a limited appli- 
cation because of its low accuracy. However, it is discussed here as it 
serves as a starting point for all other advanced methods. 

Consider the first two terms of the expansion (13.11) 


у(х) = уху) + у' (xy) (x — xy) 
Given the differential equation 


yx) = fx, y) with у(х) = Yo 
we have 

yx) = ftp, Yo) 
and therefore 


у(х) = yog) + (x — x9) frm, уо) 
Then, the value of у(х) at x = x, is given by 
Уб) = y) + (xy — x9) (ху, Yq) 
Letting A =x, ~ xy, we obtain 
X = Yo +h fixo Yo) 
Similarly, убх) at x = x, is given by 
Y= +h fo ур) 
In general, we obtain a recursive relation as 


(13.19) 


This formula is known as Euler's method and can be used recursively 
to evaluate y,, yy, ... of y(x,), убх,),..., starting from the initial condition 
Yo = Ya). Note that this docs not involve any derivatives. 

A new value of y is estimated using the previous value of y as the 
initial condition. Note that the term А f(x; y,) represents the incremen- 
tal value of y and f(x;, y,) is the slope of у(х) at (x; v), i.e. the new value 
is obtained by extrapolating linearly over the step size h using the slope 
at its previous value. That is 


New value = ald value + slope x step size 
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This is illustrated in Fig. 13.1. Remember that y, approximates y(x,) and 
уз approximates y(x). The difference between them is the crror intro- 


duced by the method. 


4 
yo 
Exact value Actual solution у(х) 
ot f) — 
ac Error in yo 
/ ,7*— Tangent line 
^^ at (x), y) 


MY) ! Error. in y, 


at (Xo. Yo) 


% E] » 


RI 


Fig. 13.1 lilustration of Euler's method for two steps 


H NEN 


Given the equation 


y(1.5) = 2 + 0.5[3(1.0# + 1] = 4.0 
y(2.0) = 4.0 + 0.51301.5) + 1] = 7.875 


Gi) = 0.25 
У) =2 
y(1.25) = 2 + 0.251312 + 1] = 3.0 
У(1.5) = 3 + 0.25(3(1.25? + 1) =Д.42188 
(1.75) = 5.42188 + 0.25[3(1.5) + 1] = 7.35938 ЕЁ 2785 " 


(2.0) = 7.35938 + 0.2513(1.75)? + 1] = 9.90626 2 ТЛ 
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Notice the difference in answers of y(2) in these two cases, The accuracy 
is improved considerably when № is reduced to 0.25 (true answer is 20.0). 


Accuracy of Eulers Method 


As usual, the accuracy is affected by two sources of error, namely, round- 
off error and truncation error. Roundoff :ror is always present in a 
computation and this ean be minimised by increasing the precision of 
calculations. 

The major cause і - loss of. accuracy is Lruncation error. Tuis arises 
because of the use of a truncated Taylor series. Since Euler's method 
uses Taylor series iteratively, the truncation error introduced in ап it- 
eration is propagated to the following iterations. This means the total 
truncation error in any iteration step will consist of two components—the 
propagated truncation error and the truncation error introduced by the 
step itself. 

The truncation introduced by the step itself is known as the local 
truncation error ond the sum of the propagated error and the local error 
is called the globai truncation error. Recall the Taylor series expansion 
used in Euler's method for estimating the values of y, 


Since only the first two terms are used in Euler's formula, the local 
truncation error is given by 


2 3 4 


If the step size А is very small, the higher-order terms may be neglected 
and therefore 


Boiss 


‘The above analysis assumes that the function y (x, y) has continu- 
ous derivatives. The local truncation error of Euler's method is of the 
order A”. If the final estimation requires n steps, the total (global) trun- 
cation error at the target point b will be 


[Bg] = Ech? це + со +... + cM = nch? 
Er 


в = (сү + Ca +. +с„уп 


n=(b-x,Vh, 
| (b — хокй 
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Compute the errors in the estimates of Example (13.4) when h = 0.5 


у= Зх? +1 
yr = Gx 
y"-6 
Step 1 
х= 1, vo = 2 
¥,= (1.5) =4.0 (от example (13.4)) 
2X aa YU рз 
Ea o nth 
= ED oy + $10.55 = 0.875 
Step 2 


ry=15,  y,=4.0 
Уз = (2.0) = 7.875 (from example (13.4)) 


601. 
== 0.5 
z 05 


E, 


ln $0»? = 1.25 


Exact solution is 
yx)2i!4x 
and therefore 
True у(1.5) = 4.875 
True y(2.0) = 10.000 


The estimated and true values of y and corresponding errors are tabu- 
lated below. 


B = е 

= Estimated y | True y E, _| Global error | 
15 40 4.875 0.875 0.875 

ій 2.0 7875 | 10000 1.250 2.125 


Note that the local and global errors are equal at the first step and 
they are different in the second step. The difference between them is the 
propagated truncation error that results from the first step. 

Observe that 


¢,(0.5)* = 0.875 since cy 

000.5)? = 1.250 since cz = 5.0 
c= (ey кє} = 4.25 

Е = cb -xgh = 425 x 1 x 0.5 = 2.125 
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This confirms the error equations for local and global truncation errors 
discussed in this section. 


Program EULER 


The program EULER estimates the s ution of the first order differen- 
tial equation у = /(x, y) at a given point using Euler's method (Eq. 
13.19). The program is simple and self-explanatory. Note the use of an 
intrinsic function — "in the line 

N = INTIXP - X}/(H + 0.5) 
Given initial value of x and the point of solution ХР, this statement 
computes the number of steps required for evaluation using the speci- 


fied step-size +. The function ТКТ returns the nearest integer of the real 
value 


^а аа Ё 
* Main program * 
9 This program estimates the solution of the first * 
* order differential equation у’ = f(x, у) at a íi 
* giver point using Euler's merhod à 
ЗК. ni - ed я * 
* Functions invoked $ 
* Р,1НТ im 
* Subroutines * 
* NIL * 
. —ÓÓ ж ана а ао Ж 


used 
* x Initial value ot 
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WRITE(*,*) 
WRITE(*,*)' SOLUTION БҮ EULERS METHOD’ 
WRITE(*,*) 


* Read values 


WRITE(*,*) ‘Input initial values of x апа y’ 
READ(*,*) X,Y 

WRITE(*,*] ‘Input x ar which Y is required' 
READ(*,*) XP 

WRITE(*,*) ‘Input step-size h' 

READ(*,*) H 


* Compute number of steps required 
М = INT((XP-X) /H«0.5) 
* Compute Y recursively at each step 


DO 10 I = LN 
DY = H'F(X,Y) 
X = хн 
Y = YDY 
WRITE(*,*) I,X,¥ 
10 CONTINUE 
* write the final result 
WRITE(*,*) 
WRITE(*,*) "Value of Y at X BN, dat. т 
WRITE(*, *) 


STOP 


End of main EULER 


Function subprogram 


tees 


REAL FUNCTION F(X,Y) 
REAL X,Y 


F s 2.0 * v/x 


RETURN 
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Test Run Results 
SOLUTION HY El S METHOD 
Input initial values of x and y 
1.0 2.0 
Input x at which y is required 
2.0 
Input step-size t 
0.25 
1 3.0000000 
2 4.2000000 
3 5.6000000 
4 7.2000000 
Value of Y at X = 2.0000000 is 7.2000000 
Stop - Pr te 


[134] HEUN'S METHOD 


Euler’s method is the simplest of all one-step methods. It does nol re- 
quire any differentiation and is easy (0 implement on computers. How- 
ever, its major weakness is large truncation errors. This is due to its 
linear characteristic. Recall that Euler's method uses only the first two 
terms of the Taylor series, In this section, we shall consider an improve- 
ment to Euler's method. 

In Euler's method, the slope at the beginning of the interval is used Lo 
extrapolate у, {о y; , , over the entire interval. Thus, 


Nisi =¥ + myh 
where m, is the slope at (x,, y,). As illustrated in Fig. 13.2, y, , | is clearly 
an underestimate of у(х, , 1). 


y» 


Im 


~ y. (Estimated using m,) 
Tangent at (x), ,) (slope т) 


-—— Ика) 
Ls 
N y. , (Estimated using m) 


yx) —* Ld (Heun's method) 
| ! "у. (Estimated using m,) 
' ' a 
* хл х 


Fig. 13.2 tiustration of Heun’s method 
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Ап alternative is to use the line which is parallel to the tangent at the 
point (æ; ү, (x, , 1)) to extrapolate from y, to y, ‚ ү as shown in Fig. 13.2, 


That is 

Xia ty + m 
where m; is the slope at (x, . |, ух, . 1)). Note that the estimate appears 
to be overestimated. 

A third approach is to use a line whose slope is the average of the 
slopes at the end points of the interval. Then 
m;*ma 
EE h 
As shown in Fig. 13.2, this gives a better approximation to y; , ү. This 
approach is known as Heun's method. 

The formula for implementing Heun's method can be constructed easily. 
Given the equation 


+12 + (13.20) 


yx) = Ах, у) 
ме can obtain 
ту= у(х) f(x») 
т«= y; m fé es Nord 
and therefore 
fi, yo lass Yi 
m= ый 


Equation (13.20) becomes 


(18.21) 


3i +4 "m" 


Note that the term y; „ ү appears on both sides of Eq. (13.21) and, there- 
fore, у, ‚ү cannot be evaluated until the value of y, , ; inside the function 
Fx; „у, Y, 41) is available. This value can be predicted using the Euler's 


formula as 
Yer =H + hxfGs y) (13.22) 


Then, Heun's formula becomes 


Hervey efe se fosa vtl | (13.23) 


Equation (13.23) is an improved version of Euler's method. Since it 
attempts to correct the values of y; , ; using the predicted value of y, , ; 
(by Euler's method), it is classified as a one-step predictor-corrector 
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method, Ед. (13.22) is known as the predictor and Eq. (13.23) is known 
as the corrector. Substituting Eq. (13.22) into Eq. (13.23), we obtain 


у= + à fx, yd for; у, у, + В fos УД) (13.24) 


Given the equation 
у(х) = 2у/х with у(1) = 2 


estimate у(2) using (i) Euler's method, and (ii) Heun's method, using 
A = 0.25, and compare the results with exact answe 


У = flx, у) = 2y/x 
=1, yor h=0.25 
(i) Euler's method 


(1,25) = Yo + he f Gg, уу) 


9 
22035 ni = 3.00 


2х3.0 
(1.5) = 8,0 + 0.25 =43 
жаб) = 8.0 +025 2020.42 
y.76)=4.2 + 0.25 25 22.56 
асо ORES 
3(2.0) = 5.6 + 0,25 ls 7 7.2 
(ii) Heun's method 
Iteration 1 
2x2 
Myers 4.0 
31.25) = 2 + 0,25(4.0) = 3.0 
2»30 
ту= “тб” 48 
»125-24939 40 4g) 31 
Iteration 2 
"UL 
PUE C 


X15) = 3.1 + 0.25(4.96) = 4.34 
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2х 4.34 
Пар "= 


s) = 3.1 + 925 (4.96 + 579) = 444 


48 


Iteration 3 


Ж1Л5) = 4.44 + 025 (5.92 + 6.77) = 6.03 


Иегайоп 4 
2 «6.03 
1.75 
y,(2.0) = 6.03 + 0.25(6.89) = 7.75 


=6.89 


m= 


002.0) = 6.03 + 225 (6.89 + 7.75) = 7.86 


Exaet solution of the equation 
убх) = 2у/х with у = 2 
is obtained as 
убо) = 2x? 
The exact values of у(х) and the estimated values by both the methods 
are tabulated below. 


| à | | ух) К | 

| Euler method | Heun’s method | Analytical 

7 100 : __ 200 | 00 300 | 

[ 135 3.00 | 310 3125 
“150 | 440 зм | 450 

[ 1л 5.60 603. | | 6125 

[ 200 7.20 7.86 | 8.00 


All estimated values are accurate to two decimal places. It is clear that 
Heun's method provides better results compared to Euler's method. 


Error Analysis 


It can be easily shown that Heun’s method is a second-order method 
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and, therefore, its local truncation is of the order Л?. Let us consider 
Eq. (13.24). Letting i = 0, for simplicity, we obtain 


Nyt i (т; + Fly + А, Yo + m, h)] (13.25) 


Let us expand the term f(x, , ,, yg + mh) in a Taylor series form, 


Roh yo + mulo = fG Yo) + MT, mA 


= тү + hf, + mf, 
On substituting this into Eq. (13.25) we get ш 


ж = yn +h, + Iq em) 13.26) 
We know that 
т=у' 
(f mif) = у" (see equation (13.13)) 
and therefore Eq. (13.26) can be written as 


Mayo the Th? 
This proves that Heun's method is of order A? and the local truncation 


error is of the order 4°, If the final estimate is obtained after n itera- 
tions, then the glohal truncation error is given by 


18,1 = X eh? = пеп? 
А 
We know that 


Therefore, 

|E,| = (b - xoch? 
That is, the global truncation error is of the order А2. 
Program HEUN 


The algorithm of Heun's method is implemented by the program HEUN. 
Note that the algorithm is very similar to that of the Euler's method, 
except for the slope used in extrapolating the initial value. 


PROGRAM HEUN 


in pr 
This program solves the first order differential * 
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F,INT 


* 
. 


NIL 


* Variables used 
е X - Initial value of independent variable 


A Y - Initial value of dependent variable 

* XP - Point of solution 

a H - Step-size 

* N - Number of steps 

E cete do аньана TERRE 
* Constants used 

. 


NIL 


REAL Х,Ү,ХР,Н,М1,М2,Р 
INTEGER N, INT 
INTRINSIC INT 
EXTERNAL F 


WRITE (*,*) 
WRITE(*,*)' SOLUTION BY HEUNS METHOD' 
WRITE(*,*) 
* Input values 
WRITE(*,*) ‘Input initial values of x and y* 
READ(*,*) x,v 
WRITE(*,*) ‘Input x at which y is required’ 
READ(*,*) XP 
WRITE(*,*) ‘Input step-size h' 
READ(*,*) H 
* Compute number of steps required 
N = INT((XP-X) /H+0.5) 
* Compute Y recursively at each step 


DO 20 I - i,N 


М1 = F(X,Y) 

M2 F(X*H, Y«MI*H) 
X = XH 

Y = Y«0.5*H* (M1«M2) 


WRITE(*,*) I,X,¥ 
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20 CONTINUE А 
* Write the final result 


WRITE(*,*) 
WRITE(*,*) ‘Value of Y at X s'y Gy 5 “ist, Y 
WRITE(*,*) 


STOP 


End of main WEWN - 


REAL FUNCTION Р(Х,Ү) 


REAL X,Y 

Е = 2,0 * уух 

RETURN 

END 
* -------------- End of function P(X,Y) ---------------* 
Test Run Results 


т 
First run 
SOLUTION BY EEUNS METHOD 


Input initial values of x and y 


1.0 2.0 

Input x at which y :s required 

2.0 \ 

Input step-size h 

0.25 
X 1.2500000 3.1000000 
2 1.5000000 4.4433330 
3 1.7500000 6.0302380 
4 2.0000000 7.8608460 


Value of Y at X - 2.0000000 is 7.8608460 
Stop - Program terminated. 
Second run 
SOLUTION BY HEUNS METHOD 


Input initial vaiues cf x,and y 
1.0 2.0 

Input x at which y is required 
2.0 
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Input step-size h 
0.125 


c зз хал шы 
VERB BHR 


Value of Y at X = 


.1250000 
.2500000 
.3750000 
.5000000 
6250000 
.7500000 
.8750000 
-0000000 


2.0000000 


Stop - Program terminated. 


Third run 


SOLUTION BY HEUNS 


5277780 


&- 
6. 
1. 


is7.9595900 


2 
3.1175920 
3. 
4 
5 


7694530 


.4833640 
.2593310 


0973560 
9974420 
9595900 


METHOD 


Input initial values of x and y 


1.0 


Input x at which y is required 


2.0 


2.0 


Input step-size h 


0.1 


1 1. 
2 1. 
3 ХУ 
4 1 
5 i 
6 1 
T 1. 
В 1. 
9 i. 
10 2. 


Value of Y at X = 


Stop 


1000000 
2000000 
3000000 


.4000000 
.5000000 
-6000000 


7000090 
8000000 
9000000 
0000000 


.4181820 
.8761710 
.3739700 
.9115800 
.4890040 
.1062420 
.7632950 
.4601640 
.1968490 
T. 


9733510 


2.0000000 is 7.9733510 


- Program terminated. 


Eas] POLYGON METHOD 


Another modification of Euler's method is to use the slope of the func- 
tion at the estimated midpoints of (x,, y;) and (х,у, У,.1) to approximate 


Уна. Thus, 


ж + Хуу Fit Yivr 


Yin = 2 


2 


= у, + f (x; + М, у, + AVIA 


\ 


p 


(13.27) 
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Ay is the estimated incremental value of y from y, and can be obtained 
' using Euler's formula as 
dy =h flan y) 
Then, equation (13.27) can be written as 


i+ hf (xp + МЭ, yi + ШӘ Го, yd) 
| hf (x, + h/2, y; + ту) 
=y; + mh 


Vier 


(13.28) 


| 
| 
| 
| 


where 


mih 
2 


msfG,y) and m= x, "m "ur 
Equation (13.28) is known as the modified Euler's method or improved 
polygon method, The method, also called the midpoint method, is illus- 
trated in Fig. 13.3. 


y»! 


| 
| 
| 


X x*h2 л 
Fig. 13.3 Midpoint method 


Like Heun's method, this method is also of the order Һ? and therefore, 
the local truncation error is of the order A? and the global truncation 
error is of the order h”. 


eee 


Estimate y(1.5) with A = 0.25 for the equation in Example 13.6 using 
polygon method, 


n 2y 
y’=flzy)= x 
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y1)220 
y(1.25) = 2.0 + 0.25 f (1 + 0.125, 2 + 0.1257 (1, 2)) 
= 2.0 + 0.25 f (1.125, 2.5) = 3.11 
(0.5) = 3.11 + 0.25 /(1.25 + 0.125, 3.11 + 0.125 /(1.25, 3.11) 
= 3.11 + 0.25 f (1.375, 3.732) = 4.47 


Estimated values of y(1.5) by various methods for the equation 
ух) = 2y/x with y(1) = 2.0 
are given below: 


Euler's method 4.20 
Heun’s method 444 
Polygon method 4.47 
Exact answer H 4.50 


Note that the vili method yields better results compared to the 
Heun's method. 


Program POLYGN 

Program POLYGN implements the polygon algorithm to solve a differ- 
ential equation of type y ' = f(x, у) 

* 


PROGRAM POLYGN 


м solves differential equation x 
* fix, у} by polygon method . 
* ЖЕРИН а s mma 
M ions invoked * 
* 'F,INT * 
* Subroutines used * 
* NIL d 
* Variables used өр 
Ы X- Initial value of the independent variable * 
» Y - Initial value of the dependent variable * 
Ы ХР - Point of solution * 
* Н - Incremental step-size * 
s М - Number of computational steps required * 
н E а pierres pes arusasa Ж 
* Constants used * 
hi NIL * 
* * 
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REAL X,Y,XP,H,M1,M2,F 
INTEGER N,INT 
INTRINSIC INT 
EXTERNAL F 


WRITE( 


v, 


WRITE(*, *)' SOLUTION BY POLYGON METHOD' 
WRITE(*,*) 


* Input values 


WRITE(*,*) ‘Input initial values of x and у' 
READ(*,*) X,¥ 
WRITE(*,*) ‘Input x at which y is required’ 


READ (* 


(7) XP 


WRITE(*,*) ‘Input step-size h’ 


READ(* 


JN) GS 


* Compute number of Steps required 
N = INT((XP-X) /H*0.5) 


* Compute Y 


ar each step 


= 1,К 

F(X,Y) 
F(X«0.5*H,Y40.5**M1) 
X+H 
Y«M2*H 


"uu 


WRITE(*,*) I,X,Y 
30 CONTINUE 


* Write the 


final value of Y 


WRITE(*,*) 
WRITE(^,*) ‘Value of Y at X 3°, X, ' ія‘, Y 
WRITE(*,*) 


STOP 


* Function subprogram a 
* 


REAL FUNCTION F(X,Y) 
REAL X,Y 


Е = 2.0 * v/x 


RETURN 


= End of function F(X,Y) -------------. * 
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Test Run Results 


SOLUTION BY POLYGON METHOD 
Input initial values of x and y 


1.0 2.0 

Input x at which y is required 

2.0 

Input step-size h 

0.25 
1 1.2500000 3.1111110 
2 1.5000000 4.4686870 
3 4.7500000 6.0728310 
4 -2.0000000 7.9235990 


i “of-¥ at X = 2.0000000 is 7.9235990 


Stop Program terminated 


RUNGE-KUTTA NSTHODS 


Runge-Kutta methods refer to a family of one-step methods used for 
numerical solution of initial value problems. They are all based on the 
general form of the extrapolation equation, 


Yi = Y; + Slope x interval size 
=y + тћ 


where m represents the slope that is weighted averages of the slopes at 
various points in the interval ^. If we estimate m using slopes at r points 
in the interval (x,, x,,,), then m can be written as 


т = шту + шута +... + Ww, M, (13.29) 


where ш, Wp... W, are weights of the slopes at various points. The 
slopes ту, m», ..., m, are computed as follows: 


my = fxn y) 
тә = f(x, + ah, у, + by; туд) 
ma = Го, + aah, у, + yy тй + bay mgh) 


m, = FR + az d у + bri, 1 Mih +... + bri, a mah) 
That is 
maFayy) r=1 


ra 
mese hen eh Уһ") r22 (13.30) 
m 
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Y Pot involves the sloj 


at all 
iod recur 


1113.401 


Not» that the computation of 5! 
previous points. Slopes cr 1 
starting from m; = „2,3. 
Runge-Kutta (RK) methods аге know 
an RE method is called the ran 
repo 


ly using equatior 


by thnir vester For instance 3 
ta method s hon slopes at 
are used to co 771 avavene slopa w 


therefore, Euler's method is a first-order Runge-Kutta merioa, oun- 
larly, Heun's method is a second-order Runge-Kutta method because it 
employs slopes at two end points of the interval. 

As it was demonstrated by the Euler and Heun methods, higher the 
order, better would be the accuracy of estimates. Therefore, selection of 


order for using RK methods depends on the problem under consider- 
ation. 


Determination of weiyiis 


For using a Runge-Kutta method, the first requirement is the determi- 
nation of weights of slopes at various points. The number of points is 
equal to the order of the method chosen. For the purpose of demonstra- 
tion, we consider here the second-order Runge-Kutta method and show 
how to evaluate variovs constants and weights. 

The second-order RK method has the form 


Jur m Y; eam, + wy h Gas 


where 

my fx, y) 

ma = f(x; + arh, y; + Буу ту) (13.32) 
The weights шу and ws and the constants a, and уу are to be deter- 
mined. The centre principle of the Runge-Kutta approach is that these 
parameters are chosen such that a power series expansion of the right 
side of Eq. (13.31) agrees with the Taylor series of expansion of Yi in 
terms of y, and fx,, у,). 

The second-order Taylor series expansion of у, д about у; is 


Yin =» +уй + Tht (13.33) 
Given 
yi =f ay у) = 


а, 
at thc 
Therefore, Eq. (13.33) becomes 


Jur =9,+ fhe (hth Pf) a (13.34) 
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Now, consider the right side of Eq. (13.31). Since zn, is already a func- 
tion of x; and y;, we need to expand only m, ав a power series in terms of 
f(x; y). From Eq. (13.32) 
ma = Fx; + ash, y; + буту) 
Expanding the function on the right-hand side using the Taylor series 
expansion, we get 
ma = f(x, y) + ah f, + by mh f, + О(А?) 
Substituting this in Eq. (13.31) and replacing m, = f(x, у,) by f, we get 
Xia = luf + шу + шаһ, + wibyh f [ДА + Oh’) 

= y; + (wy + wahf + (uoa f, + woby f fh? + Oh?) — (13.35) 
If Eqs (13.34) and (13,35) are to be equivalent, then they should agree 
term by term. This is possible only if 


U,*u,zl 
wa, = 12 
wz by, V2 


Note that we have four unknowns and only three equations. Therefore, 
there is no unique solution. However, we can assume a value for one of 
{һе constants and determine the others. This implies that there is an 
infinite family of second-order RK methods. For example, if we choose 
ш, = 1/2, then we get 


wp=12, w= U2, aj-l  bu-l 


With these values, Eq. (13.31) becomes 


(13.36) 
where, 
my = [(х„ y) 
My = f(x, + h, y; + mh) 
Note that this equation is the Heun's formula. 
Similarly, if we choose w, = 0, then we get 
1! wj70, wal, а= 12, by-12 
and Eq. (13.31) becomes 
(13.37) 


where 


m = (х,у) 
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mih 
2 


ње, 


This results in the midpoint or polygon method, 

Another strategy is to choose the parameters such that the bound on 
the truncation error is minimum. It has been shown by Ralston that 
шу = 1/3 and wg = 2/3 produces minimum truncation error. With these 
weights, 


(13.38) 
where, 
m, = f(x, y) 
3 
т, - (s. E * Smh) 
Fourth-Order Runge-Kutta Methods 
It is clear from Eq. (13.29) and the discussions above that it is possible to 


construct RK methods of different orders. However, the commonly used 
ones are the fourth-order methods. Although there are different versions 
of fourth-order RK methods, the most popular method is the classical 
fourth-order Runge-Kutta method given below: 


m= f, y) 


h 
т (а mt) 


^ (13.39) 


та + 


m, = f(x, + hy у, + mah) 


imm e 


Use the classical RK method to estimate y(0.4) when 
Y@)=2+y? with (0) =0 
Assume h = 0.2 


Senet i? EA) 
m = fleo yo = 0 
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точ зан) f(0.1, 0) = 0.01 


h moh f>2 0.01х 91x 0.2 

то Вота > 7. figo. jJ 

т, = f Cta + Yo + mgh) = f(0.2, 0.01 x 0.2) = 0.04 
yiia e ee 0.2 %0-01 + 0-04 9 9 _ 0.002667 

Iteration 2 

1,202 

уз = 0.002667 

m, = f (0.2, 0.002667) = 0.04 


m, = rfo. 3, 0.002967 , 04x02) = 0,090044 


= 0.090136 


se оз, 0.002667+ ози) 


т, =f (0.4, 0.002667 + (0.090136X0.2)) = 0.160428 


у(0.4) = 0.002667 + 0.04 + 2(0.090044) + me 090136) + 0,1 160428 | 
= 0.021360 (correct to six decimals) 


The exact answer is 0.021359. If we use Л = 0.1, then у(0.4) will be 
0.021359. Try and check. 


Program RUNGE4 


RUNGE4 is a program designed to compute the solution of a first order 
differential equation of type y' = f(x, y) using the fourth-order Runge- 
Kutta method. 


* Main program t 
* This program computes the solution of first order * 
* differential equation of type y' = f(x,y) using * 
* the 4th order Runge- Kutta method * 


* Functions invoked 
F,INT 
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* Subroutines used * 
* NIL * 
* variables used ¥ 
» X - Initial value of independent variable * 
* Y - Initial value of dependent variable & 
b ХР - Point of solution * 
* Н - Step-size . 
id N - Number of steps ¥ 
ЫНЫР pa tae aa teens amie E 
* Constants used * 
* NIL . 
Ae TCR * 


REAL X,Y,XP,H,M1,M2,M3,M4,F 

INTEGER N,INT 

INTRINSIC INT 

EXTERNAL F 

WRITE(*,*) 

WRITE(*,*)' SOLUTION BY 4 TH ORDER R-K METHOD‘ 
WRITE(*,*) 


* Input Values 


WRITE(*,*) 'Input initial values of x and y' 
READ(*,*) X,Y 

WRITE(*,*) 'Input x at which y is required' 
READ(*,*) XP 

WRITE(*,*) ‘Input step-size h' 

READ(*,*) H 


* Compute number of steps required 
N = INT((XP-X) /H*0.5) 
* Compute Y at each step 


зоо Г 
WRITE(*,*)' STEP x Y = 
WRITE(*, 4)! О ------------------------------ Ё 


DO 40 1 = 1,N 

Ml = F(X,Y) 

M2 = P(X+0.5*H, Y*0.5*M1*H) 
F(X+0.5*H, Y«0. 5*M2*H) 
F (X+H, ¥+M3*H) 
X = X+H 
Y = ¥+(M1+2.0*M2+2.0*M3+M4) *H/6.0 
WRITE(*,*) I,X,Y 


El 
"n 
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40 CONTINUE 
WRITE(*,*)* ---------------------------------- 


* Write the final value of Y 


WRITE(*,*) 
WRITE(*,*) ‘Value of Y at X =", X,' is’, Y 
WRITE(*,*) 


STOP 


* --------------- End of main RUNGE4 ---------- 


REAL FUNCTION F(X,Y) 


REAL X,Y 
Р = 2.0 * Y/x 
RETURN 
END 
+ --------------- End of function F(X,Y) ------------ * 
Test Run Results 


SOLUTION BY 4_TH ORDER R-K METHOD 
Input initial values of x and y 


1.0 2.0 

Input x at which y is rem 

2.0 

Input step-size h 

0.25 
STEP x Y 
1 1.2500000 3.1246910 
2 1.5000000 4.4993830 
3 1.7500000 6.1240550 
4 2.0000000 7.9986960 


Value of Y at X - 2.000000 7.9986960 


Stop - Program terminated. 


E ACCURACY OF ONE-STEP METHODS 


How do we achieve the desired level of accuracy in one-step methods? 
One approach is to repeat the computations at decreasing values of А 
until the required accuracy is obtained. Th- may involve a large 
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number of repetitions if the initial value of А is far away from the 
optimal value. Another approach is to estimate the value of А that is 
likely to give the desired level of accuracy. 

It is very difficult to have a formula in terms А for the global error. 
However, we know that the order of global truncation error is A" if the 
order of local truncation error is л" * !. We can use this information to 
study the sensitivity of the solution to the value of ^ and thereby esti- 
mate the size of А. Obtain estimates of y(A) at two different values of h, 
say h, and Ag. Then 


Уай 7 УВ, hj) = chy 


Ула У, ha) = chy 
where c is the constant of proportionality. The above equations сап be 
solved for c as B 


(13.40) 


c 


рф, шом) 
ЫЛ 

If we need the answer to an accuracy of d decimal places, then the error 
must not be greater than 0,5 x 10~ and, therefore, 


ch’ «0.5 x 10^ 
Thus 


(h; - hz )10-* 


Ar, = 
24у 


5 (13,41) 


where Ay = y(b, h;) — y(b, hy). For example, for a second-order method, 
the global error is proportional to A? and therefore hop is given by 


СЕНГЕ 


Any value of h < hoot will give the desired accuracy. 


‘Two estimates of y(0.8) of the equation 
уб) =22+5у2 with y0)=1 
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are obtained using fourth-order RK method at А = 0.05 and h = 0.025. 
40.8, 0.05) = 5.8410870 
3(0.8, 0.025) = 5 "479627 
Estimate the value of h required to obtain the solution accurate to 
_(@) four decimal places and (ii) six decimal places 


G) d-4 
4 4 

hy 510 4.26 « 10-8 
h = 0.01437 

äi) d=6 

(0.054 ~0.0254)10-8 _ a 

t 777 9(x0068787) — 420 * 10" 
һ = 0.00454 


We may use й = 0.01 to obtain a figure accurate to four decimal places 
and h = 0.004 to obtain a figure accurate to six decimal places. 


Rez] MULTISTEP METHODS 


So far we have discussed many methods for obtaining numerical solu- 
tion of first-order initial-value problems. All of them use information 
only from the last computed point (x,, y,) to compute the next point (x; , ,, 
Ji „ у). Therefore, all these methods are called single-step methods. They 
do not make use of the information available at the earlier steps, у, . |, 
Yi- a) ete, even when they are available. It is possible to improve the 
efficiency of estimation by using the information at several previous 
points. Methods that use information from more than one previous points 
to compute the next point are called multistep methods. Sometimes, a 
pair of multistep methods are used in conjunction with each other, one 
for predicting the value of y; , , and the other for correcting the predicted 
value of y, ‚ з. Such methods are termed predictor-corrector methods. 
One major problem with multistep methods is that they are not self- 
Starting. They need more information than the initial value condition. If 
a method uses four previous points, say yo, Yı Ya» and ys, then all these 
values must be obtained before the method is actually used. These val- 
ues, known as starting values, can be obtained using any of the single- 
step methods discussed earlier. It is important to note that the degree оў 
accuracy of the single-step method must match that ^f the multistep 
method to be used. For instance, a fourth-order RK me hod is normally 
used to generate starting values for implementing a fourth-order multi- 
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step method. In this section, we consider the following popular multistep 
methods: 

1, Milne-Simpson method 

2. Adams-Bashforth-Moulton method : 
Both of them are fourth-order methods and use a pair of multistep 
methods in conjunction with each other. 


Milne-Simpson Method 


‘The Milne-Simpson method is a predictor-corrector method. It uses a Milne 
formula as a predictor and the popular Simpson's formula as a corrector. 
‘These formulae are based on the fundamental theorem of calculus. 


ye, Day) + [ fi y) dx (13.42) 


a 


When j = i — 3, the Eq. becomes an open integration formula and produc- 
es the Milne's formula 


[rennin Oh a haem (19.43) 


Similarly, when j = i — 1, Eq. (13.42) becomes a closed form integration 
and produces the two-segment Simpson's formula 


ШЫ; Mittit hid (13.44) 


3 


Milne’s formula is used to ‘predict’ the Starting points 
value of y, , , which is then used to cal- Yi-aVinr У 
culate f, , , (in Eq. (13.44) from the dif- 
ferential equation. 


fier =P оа) 

Then, Eq. (13.44) is used to correct the 
predicted value of у, , ;. The process is 
then repeated for the next value of i. 
Each stage involves four basic calcula- 
lions, namely, 

1. prediction of y, , , 

2. evaluation of f, , | 

3. correction of y; , ; 

4. improved value of f; , , (for use in 


Evaluation of f., 1 


Correction of y;, 1 


next stage) 
lt is also possible to use the corrector Next stage 
formula repeatedly to refine the esti- (if necessary) 


mate of y,, т before moving on to the рю 13.4 implementation of pre- 
next stage (see Fig. 13.4). dictor-corrector meth- 
ods 
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 -—tn __ 


Given the equation 
ya with yy=2 


estimate y(2) using the Milne-Simpson predictor-corrector method. 
Assume Л = 0.25. 


Milne's formula at i = 3 is 

Xt syot EOR- poi) 
Simpson's formula at i = 3 is 

э уал +7?) 
where f, = f(x, уу) 

To use these formulae, we require the estimates of Уһ Уз and уз in 
addition to the initial condition yo. These can be obtained using any of 
the single-step fourth-order methods. 

Let us assume that they have been estimated using the fourth-order 


RK method as follows: 
y = y(1.25) = 3.13 


ys 2 У(1.5 ) = 4.50 
Ya = (1.75) = 6.13 


Тһеп 
2х3.13 
laa ei 
2x45 
h is 78:00 
2х6.13 


Substituting these in Milne’s formula we predict the value of y(2) as 


4x 0.25 


p 
= 2.00 
Yi + 3 


(2x 5.01 — 6.00 + 2 x 7.01) = 8.01 


АЕ a 


2 
Now we obtain the corrected value of y(2) using Simpson formula as 


yi =450+ 235 (6.00 + 4 x 7.01 + 8.01) 


=8.00 
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We can again use the corrector formula to refine the estimate 


yf = 4.50 + 028 (6.00 + 4 x 701 +801) 


- 800 
Note that the exact solution is у(2) = 8. 


Program MILSIM 


An algorithm for evaluating the equation y’ = f(x, у) using Milne-Simpson 
method is illustrated in Fig. 13.4. Program MILSIM shows the imple- 
mentation of the algorithm in details. The program does the following: 

1. Computes the starting points using fourth-order RK method 

2. Predicts the function value by Milne's formula 

3. Corrects the value obtained using Simpson's method 

4, Writes the results 


* Main program * 
* — This program г the first order differential * 
* equation y' д Miine-Simpson method * 
* Functions invoked * 
* ¥, INT * 
* Subroutines used * 
* NIL * 
ЭЙ c 2 — + 
* variables used * 
- X(1) - Initial value of independent variable ы 
* Yo) initial value of dependent variable « 
* XP - Point of solution Ы 
* N Numb: steps * 
* н tep-size à 
* X - Array oí independent variable 

* ? dependent variable 


Constants used 
* NIL 


see eee 
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REAL X, Y,H, XP, M1,M2,M3, MÀ, SUM1, SUM2,F 
INTEGER N,INT 

INTRINSIC INT 

EXTERNAL F 

DIMENSION X(10), ¥(10) 


Read values 


А 


WRITE(*,*) ‘Input initial values of x and y' 
READ(*,*) x(1),Y(1) 
WRITE(*,*) ‘Input x at which y is required: 
READ(*,*) ХР 
WRITE(*,*) ‘Input step-size h' 
READ(*,*) H 
Compute number of computations involved 
N = INT((XP-X(1))/H + 0.5) 


We need four starting points for Milne-Simpson method. 
Initial values form the first point. Remaining three 
points are obtained using 4th order RK method 


tee 


WRITE(*,*) 
WRITE(*,*) ‘INITIAL VALUES', X(1), Y(1) 
WRITE(*,*) 


* 


Computing three points by RK method 
WRITE(*,*) “THREE VALUES BY RK METHOD' 


DO 10 І 1,3 
Ml = F(X().YUD) 
M2 = F(X(I)«0.5*H, Y(I)40.5*M1*H) 
M3 = P(X(I)+0.5*H, Y(I)40.5*M2*H) 
M4 = P(X(I)+H, Y(I)«M3*H) 
X(IM) = X(I)«H 


Ү(1+1) Ү(Т)+(М1+2.0*Мм2+2.0*М3+м4)*н/6.0 
WRITE(*, I, Х(ІК1), Ү(ї+1) 
10 CONTINUE 
WRITE(*, *) 
WRITE(*,*) ‘VALUES OBTAINED BY MILNE-SIMPSON METHOD’ 


=4, N 

F(X(I-2), ү(т-2)) 
Р(Х(1-1), Ү(ї-1)) 
F4 = P(X(1), Ү(т)) 


* Predicted value of У (by Milne’s formula) 


Ү(І+1) = Y(I-3)«4.0*H/3.u- \4.0*Е2-Р3+2.0*Р4) 
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X(I+1) = x(t) + H 
FS = FIX(I141), Y(Is1)) 


* Corrected value of Y (by Sim-son's formula) 


Ү(1+1) = Y(T-1) + H/3.0 *(F344.0*F4«F5) 
WRITE(*,*) I, Х(1+1), Y(I:.1) 
20 CONTINU. И 
WRITE(*,*) 
WRITE(*,*)'Value of Y at X =", X(N41)," ist, 
Y(N«1) 
WRITE(*,*) 


STOP 
END 


ж ee 


REAL FUNCTION F(X,Y) 
REAL X,Y 


F = 2.0 * v/x 


RETURN 
END 


End of function F(X,Y)---------...... * 


Test Run Results 
ee 


Input initial values of x and Y 


1.0 2.0 

Input x at which y is required 

2.0 

input step-size h 

0.125 

INITIAL VALUES 1.0000000 2.0000000 

THREE VALUES BY RK METHOD 
1 1.1250000 2.5312380 
2 1.2500000 3.1249770 
a 1.3750000 3.7812150 

VALUES OBTAINED BY MILNE-SIMPSON METHOD 
4 4.4399660 
5 5.2812040 


6 1.7500000 6.1249520 
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7 1.в750000 70311899 

8 2.0000000 7.9999350 
Value of Y at X = 2.0000000 is 7.9999360 
Stop - Program terminated. 


Adams-Bashforth-Moulton Method 


Another popular fourth-order predictor-corrector method is the Adams- 
Bashforth-Moulton multistep method. The predictor formula is known 
as Adams-Bashforth predictor and is given by 


9f, aj (13.45) 


[yam gi BBA ~ БЛ. + 37а 


The corrector formula is known as Adams-Moulton corrector and is giv- 
en by 


К _ _ - 
ent BM mun (13.46) 


This pair of equations сап be implemented using the procedure described 
for Milne-Simpson method. 


Repeat Example 13.10 using Adams-Bashforth-Moulton method. 


By Adams predictor formula 


Xp» E (of, — 39f, + 3T ~ fo) 


= 6.13 + 925 (55 x 7.01 ~ 59 x 6.00 + 37 x 5.01 - 9х4) 


= 8.0146 
2x 8.0146 
= 8.0146 


By Adams corrector formula 
yi =» + eh 5% + 19% + Э//) 


= 6.13 + 9:25 (591 — 5 x 6.00 + 19 x 7.01 +9 х 8.0146) 


24 
= 8.0086 


y‘ (refined) = 8.0079 
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B ACCURACY OF MULTISTEP METHODS 


We know that for each differential equation, there is an optimum step 
size h. If h is too large, accuracy dimi--ishes and if it is too small, round- 
off errors would dominate and reduce the accuracy. 

By computing the predicted and corrected values of y, , ү, we can 
estimate the size end sign of the error. Let us denote the predicted value 


by y/, ү. Similarly, denote the truncation error in predicted value by Ep 
and corrected value by E,.. Then, we have, 

Е„=у-У[д 

Е. =Y- 


where y denotes the exact value of y(x; , |). The difference between the 
error is 


Е-Е. Уб УР (13.47) 


A large difference indicates that the step size is too large. In such cases, 
we must reduce the size of A. 


Milne-Simpson Method 
Both the Milne and Simpson formulae are of order À* and their error 


terms are of order Аб. 
The truncation error in Milne's formula is 
28 
Ep = gg Y (8) HE 
The truncation error in Simpson's formula is 
л 1 Б 
Е. =- 902" CaA" 


If we assume that y(@,) = у®(@Ө„) then 


Е, 

2 =- 

ашы 
ог, 

Ey =- 2ВБ„ 


Substituting this in Eq. (13.47) we obtain, 


Fiabe 
в, = cfi 
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If the answer is required to a precision of d decimal digits then 


E 
Уа 
“Sa 05х10 
= < 05 x 10 
or 
[CREE < 29/2 x 10* = 15 x 10% | (13.48) 
ac enu 
Adams Method 
The truncation error in Adams-Bashforth predictor is 


Е, = 2 yo (o) je 


and the truncation error in Adams-Moulton corrector is 
=-29 49 5 
E, 7-350 У [САД 


Then, assuming y?(8,) = у'®\ 8), we get 


Substituting in Eq. (13.47) results in 


E, =- (г -yt 


For achieving an accuracy of d decimal digits, 


Ea TYP a| «27009 x 0.5 x 10% = 7 x 104 (13.49) 


According to Eqs (13.48) and (13.49), à should be reduced until the 
difference between the corrected and predicted values is within the speci- 
fied limit. It should be noted, however, that if the step length is changed 
during the calculation, it will be necessary to recalculate the starting 
points at the new step value. 
Modifiers 
Using the error estimates, we can modify the estimates of Yi,, before 
proceeding to the next stage. That is 

УУ + Be 
For Milne's method 


маа (У.а) 
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For Adams meth d S 


Yis3 Hie 39 0t -Fha 
BEES 
Solve the differential equation 

у a=- 

for y(2.0) using the Milne-Simpson method with the application of modi- 
fier to the corrector. The first four points are given under 

| n= 
1.0000000 
0.8333333 
0.7142857 
019250000 


0.5102040 
.3906250 
Iteration 1 


y(1.8) = y? = yot D of, -fa + f 


=10+ = xaz (-1.13888889 + 0.5102040 — 0.7812500) 
= 0.5573506 


2 
? = (у?) =-0.3106396 
жа В Р 
Xi =з +( +40 ) 
= 0.7142857 + 92 (— 0.5102040 — 1.56250 — 0.3106396) 


= 0.5553961 


Р = 
Modifier Е, =--+—* = 0,0000674 
29 
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Modified y, = y; + Е, = 0.5554635 
Iteration 2 
2 
fu7 (vi) = -0.3085397 


x20 = yE = у, + 40 of, +95) 
= 0.5008366 


ff = –0.2508373 


Уз уз Анал +72) 
= 0.4999585 
Modifier Е. = 0.0000303 
Modified у; = 0.4999888 


Exact answer = 0.5 
Error = 0.0000112 


-_ _ —_—_—_——_—_——_————___ 
ШЕШ SYSTEMS OF DIFFERENTIAL EQUATIONS 


Mathematical models of many applications involve a system of several 
first-order differential equations. They may be represented as follows: 


4 
um Tf уьу ә yw. Yio) = yi 
Чу; _ m 
ах = I Ya Im Yao) = Yao 
(13.50) 


dy, 
ue = б, Yis Yor Ул), Yao) = Ymo 


These equations should be solved for V10), уз(х), ..., у(х) over interval 
(a, b). 

These equations сап be solved by any of the methods discussed in this 
chapter. At each stage, all the equations are solved before proceeding to 
the next stage. For example, if h = 0.5 and a = хо = 0, then we must 
evaluate y,(0.5), у2(0.5),..., y,,(0.5) before Preceding to the stage A = 1.0. 
Let us consider a system of two equations for the purpose of illustration. 
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Усе Аб, уру, У) = Уо 


Уб) = х, yn yg, Ухо) = Yao 
Assume that we want to use the Heun’s method. The first stage would 
involv. е following calculations. 

mj) = fiio, уз» Уго) 

т\(2) = fixo, Уу Узо) 

т) = Ао + h, ую + Ат), уо + hm,(2)) 

m2) = folxy + h, yyg + Ату), yog + Am, (2) 
_ my (1) + mg (1) 


m(1) 5 


m(2) =m em 


yix) = 101) = у(х) + mOM = Yio + mS 
yox) = ys) = уух)  m(2)h = у + m(2)h 


The next stage uses y,(1) and y;(1) as initial values and, by following 
similar procedure, у,(2) and у,(2) are obtained 


Given the equations 


dy, 
dx Tz tte 2100) =1 
dy; 
Wecltnxe x02 


Givenx;20, ую=1.  x*»--l 
т\(1) = fix Ys Yo) 50 1-120 
my(2) = fixo, yu Yoo) 21 1-171 
m1) = fis h, у + AMD, yag + hm4(2)) 
zf(0.1,140.1x 0, -1 + 0.1 x 1) 
(0.1, 1, — 0.9) 
=01+1-0.9=0.2 
m2) = [Дх + h, уд + hmi), уру + hmy(2)) 
= f4(0.1, 1,—0.9) 
=1+1-09=1.1 
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m, + mg Q) 


т(1) = 2 =0.1 
2) 
m(2) A mem, 105 


у1(0.1) = y,(0) + Am(1) = 1 (0.1) (0.1) = 1.01 
¥2(0.1) = у,(0) + А т(2) = –1 + (0.1) (1.05) = -0.895 


а HIGHER-ORDER EQUATIONS 


We have seen in the introductory section of this chapter that many 
problems involve the solution of higher-order differential equations. A 
higher-order differential equation is in the form 


d™y ( ау diy ату 
d"x x’ dx? axed 


Y 
] (13.51) 


with m initial conditions given as 
Ух) sa, yx) as vy" хо) = Oy 


We can replace Eq. (13.61) by a system of first-order equations as follows: 
Let us denote 


d d? dm- 
у=, ED a “ies Sr 
Then, 

Wia (x) у= 

dr 7X уух) = ую = а, 
Чу 
2s Ухо) = Yoo = az 

(13.52) 

dy, 
=r Ут - (50) =Ym-1,0= 4s i 
bn 

"аз =Й, Yon Ун) Ymo) = Ymo = Gm 


This system is similar to the system of first-order Eq. (13.50) with the 
conditions 

fiJo  isL2.,m-1 

Tin = Ро, уь Yas 5 Ум) 
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and, therefore, can be solyed using the procedure discussed in the previ- 
ous section. 


Sol» *e following equation for (0.2) 


given y 


Let 


Then, 


Yoo = 1 


Let h = 0.2 
my (1) = уд = 1 
т\(2) = бху + Зу — 2y4 2 - 2 
ту(1) = yay + йт\(2) = 1 + (0.2) — 
m5(2) = 6(0.2) + 3(0 + 0.2(1)) ~ 2(0.6) 


0.6 
12+ 0.6 - 1.2 = 0.6 


1+0.6 
(1)=——=0. 
m 2 0.8 


-240.6 
2 


у100.2) = у\(0) + 0.2m(1) = 0 + 0.2 х 0.8 = 0.16 
y (0.2) = у;(0) + 0.2m(2) = 1 — (0.2)(0.7) = 0.86 
yx) at x = 0.2 = 0.16 
у(х) at x = 0.2 = 0.86 


EC SUMMARY 


We encounter differential equations in many different forms while at- 
tempting to solve real life problems in science and engineering. The most 


m(2) = =-0.7 
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common form of differential equations is known as ordinary differential 
equation. In this chapter, we considered various methods of numerical 
solution of ordinary differential equations. ‘They include 

* Taylor series method 

* Eulers method 

* Heun's method 

Polygon method 

* Runge-Kutta method 

* Milne-Simpson method 

e Adams-Bashforth-Moulton method 

We also discussed the accuracy (and techniques of improving the ac- 
curacy) of these methods. Finally, we discussed the solution of a system 
of differential equations as well as higher-order equations. 

We presented FORTRAN programs and their test results for the fol- 
lowing methods: 

* Euler’s method 

* Heun's method 

* Polygon method 

* Runge-Kutta method 

* Milne-Simpson method 


Key Terms 

Adams-Bashforth predictor ^ Multistep methods 
Adams-Moulton corrector One-step methods 
Boundary-value problem Ordinary differential equations 
Corrector Partial differential equation 
Differentia! equations Picard's method 
Euler's method Polygon method 
Global truncation error Predictor 
Heun's method Predictor-corrector method 
Initial-value problem Radioactive decay 
Kirchhoffs law Runge-Kutta method 
Law of cooling Semi-numeric method 
Law of motion Simple harmonic motion 
Local truncation error Simpson's formula 
Midpoint method Starting values 
Milne's formula Taylor series method 

filno- 
Milne-Simpson method 


1, What is a differential equation? Give two real-life examples of ap- 
plication of differential equations. 
2. Distinguish between ordinary and partial differential equations. 
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. State the degree and order of the following differential equations: 


(a) ("2 «Ty 20 

(b) y" + 5y -1 

(с) у'"-у=0 

(aa ау? 0 

(е) , «3y-2y =x? 

O муў -2xy «y «0 

(g) xy’ - (x + Dy = 0 

th) yo + ay y - 2x7 = 0 

G) OF «y =0 

What is а nonlinear differential equation? State which of the equa- 
tions given in Question 3 are nonlinear. 

What js an initial-value problem? How is it different from a bound- 
ary-value problem? 


. Why do we need to use numerical computing techniques to solve 


differential equations? 


. State the basic two approaches used in estimating the solution of 


differential equations. How are they different? 


. Describe how Taylor's theorem of expansion can be used to solve a 


differential equa ion. 


. What are the limitations of Taylor's series method? 
. State the formula of Picard's method to solve the differential equa- 


tion of Lype 
dy X 
Че “PY? 


What are its limitations? 


- State the formula of Euler's method. Illustrate its concept graphi- 


cally. 


. Comment on the accuracy of Euler's method. 


Illustrate Heun's method of solution graphically. 


. How does the accuracy of Heun's method compare with that of 


Euler's method? 


. Heun's method is an improved version of Euler's method. Com- 


ment. 


. Why is Heun's method classified as one-step predictor-corrector 


method? 


. Why is the polygon method called the midpoint method? Illustrate 


graphically. 


. Describe the basic concept employed in Runge-Kutta methods. 
. What is meant by an r-order Runge-Kutta method? What is the 


order of the following methods? 
(a) Euler's method 
(b) Heun's method 
(c) Polygon method 
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slistep ш 
merits and demerits of mullistep meth Js? 

formulae used in Milne-Simpson method. De 

ion scheme of these formulae, 

5 ‘sor and corrector formules used in Ada 


ti uracy of multi- 


step mezho. 
25. A high- order differential equation can be solved by replacing it by a 
system of first-order equations. Discuss 


1. Use Taylor's expansion (with terms up to x’) to solve t^ 7 owing 
differential 


=х+у+ху, (0) = 
pp C Y vy, РА 


уф? - 1), 


(e 5 =х+у, y( = 1 


Zo y, y0)=1 
x: for x = 0.25 and 0.5 


(e oy =x, ж) =0 
for x = 20 and 3.0 
ay)" 9 
(5 E ху. yO)=Lend 90) =1 
for x = 0.2 wai дА 
92у (аху? n 
@) 102 (4) +8х=0, 0) =1 and x(0)= 


fort = 2 and 5. 
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2. Solve the following equations by Picards method and estimate y at 


x = 0.25 and 0.5: 
(a) Partas, (0) = 1 
(b) e = у-у, 0) =1 
e @Узх+у, x0)21 
(d) ay avy; yi) =0 
(e) Z-z, 200) =1 


3. Use the simple Euler's method to solve the following equations for 
y(1) using В-= 0.5, 0.25 and 0.1 

(0) =1 

у00) =2 


Г 


У(0) = 1 


У(0) = 1 


(e) yi sF y +y, x0)=1 
_A-Solve the differential equation 


ay =х+у, (0) =1 
by the simple Euler's method to estimate у(1) using A = 0.5 and 
h = 0.25. Compute errors in both Lhe cases. How do they compare? 
Also, compare your results with the exact answer given the analyti- 
cal solution as 

уф) = 2е'-х-1 

5. Use Heun’s method with A = 0.5 and h = 0.25 to solve the equations 
in Exercise 3 for y(1). How do the results compare with these ob- 
tained using simple Euler's method. 

6. Repeat Exercise 4 using the Polygon method instead of simple 
Euler’s method. Analyse the results critically. 

7. Use the polygon method to solve some of the equation in Exercise 3. 

8. Use the classical RK method to estimate y(0.5) of the following 
equations with А = 0.25. 


à 
а) arty, 30-1 
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10. 


12 


13. 


14. 


dy x = 
(b) ae F yO) =1 
4 
(©) e =у coss, (0) = 1 
(4) D ays sing, (0) = 2 
dy [y » 
(0) с ту + Ҹу, 00) =1 
. Solve the following initial value problems for x = 1 using the fourth- 
order Milne's method. 
а) ЧУ ay ui 0) = 1 


Use a step size of 0.25 and fourth-order Runge-Kutta method to 
predict the starting values. 

Repeat Exercise 9 using Adam's method instead of Milne's method, 
Compare the results. 


. Repent Exercise 10 and 11 with the application of modifier to the 


corrector. Compare the results. 


Solve the pair of simultaneous equations 
dy, 
E n у\0) = 0 


dy, 
Fe user 00) = 0 


to estimate y,(0.2) and y,(0.2) using any method of your choice. 


Solve the following equation for (0.2): 
d?y [dyY 4 
ir (a) *6:-0, y0)=1, у/(0) =0 


Use Heun's method. 
The general equation relating to current i, voltage V, resistance R, 
and inductance L of a serial electric circuit is given by 


A oss 
La +iR=V 
Find the value of current after 2 seconds, if resistance R = 20 ohms, 


inductance L = 50 Н and voltage V = 240 volts. Current J = 0 when 
t=0. 
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15. A tank contains a solution which is made dissolving 50 kg of salt in 


16. 


17. 


100 gallons of water. A more concentrated solution of З kg of salt 
per gallon of water is pumped into the tank at a rate of 4 gallons 
per minute. The solution (which is stirred continuously to keep it 
uniform) is pumped out at a rate of 3 gallons per minute. 

Find the amount of salt in the tank at ¢ = 15 minutes. Note that 
initially at t = 0, the amount of salt x = 50 kg. 


Hint: If x represents the amount of salt in the tank at time t, E: 


will represent change in the amount of salt. 
An object with a ma. of 10 kg is falling under the influence of 
earth gravity. Find its velocity after 5 seconds if it starts from the 
rest. The object experiences a retarding force equal to 0.25 of its 
velocity. 
Hint: The relationship between the various forces is given by 


dv 
mass x =~ = mass x gravity - retarding force 


dt 
where v is velocity at timo t. 
A body of mass 2 kg is attached to a spring with a spring constant 
of 10. The differential equation governing the displacement of the 
body y and time ¢ is given by 


d*y dy 

an t? ar 
Find the displacement у at time t = 1.5 given that y(0) = 2 and 
y0) = -4. 


+5y=0 


. Rewrite the program RUNGE4 such that a subprogram implements 


the fourth order Runge-Kutta method and a main program receives 
input information, drives the subprogram to compute the solution, 
and prints the required output information. 


. Modify the program MILSIM to incorporate the following changes: 


(a) Computing the starting points by Runge-Kutta method using a 
subprogram 

(b) Implementing Milne-Simpson algorithm by a subprogram 

(c) Applying the modifier to the corrector with the help of a sub- 
program. 


. Develop a user-friendly modular program as suggested in Project 2 


for the fourth-order Adams method with modifiers. 


. Develop a user-friendly program for solving systems of differential 


equations using Euler's method or Heun's method. 
Repeat Project 4, but use the fourth-order Runge-Kutta method 


Boundary Value and 
Eigenvalue Problems 


Eram NEED AND SCOPE 


We have seen that we require m conditions to be specified in order to 
solve an m-order differential equation. In the previous chapter, all the m 
conditions were specified at one point, x = хо, and, therefore, we call this 
problem as an initial-value problem. It is not always necessary to specify 
the conditions at one point of the independent variable, ‘They can be 
specified at different points in the interval (a, b) and, therefore, such 
problems are called the boundary value problems. A large number of 
problems fall into this category. 

In solving initial value problems, we move in steps from the given 
initial value of x to the point where the solution is required. In case of 
boundary value problems, we seek solutions at specified points within 
the domain of given boundaries, for instance, given 


2 
g =fayy)  ya)=ya, yb) =y, (14.1) 


dx 


we are interested in finding the values of y in the range a < x < b. 

There are two popular methods available for solving the boundary 
value problems. The first one is known as the shooting method. This 
method makes use of the techniques of solving initial value problems. 
The second one is called the finite difference method which makes use of 
the finite difference equivalents of derivatives. 

Some boundary value problems, such as study of vibrating systems, 
Structure analysis, and electric circuit system analysis, reduce to a system 
of equations of the form. 


Ах = х (14.2) 
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Such problems are called the eigenvalue problems. We need to determine 
the values of 4 and vector x which satisfy the Eq. (14.2). We have two 
simple methods available to solve this type of problems. 

1. Polynomial method 

2. Power method 
In this chapter, we consider these two special categories of problems and 
discuss the following methods to solve them: 

1. Shooting method 

2. Finite difference method 

3. Polynomial method 

4, Power method 


Bz SHOOTING METHOD 


This method is called the shooting method because it resembles an 
artillery problem. In this method, the given boundary value problem is 
first converted into an equivalent initial value problem and then solved 
using any of the methods discussed in the previous chapter. The approach 
is simple. Consider the equation 

Y'-fG, yy? ya)=A, у) = В 
By letting y’ = 2, we obtain the following Бе! of two equations: 

у =2 

z' = f(x,y, 2) 
In order to solve this set as an initial value problem, we need two 
conditions at x = a. We have one condition yu) = A and, therefore, 
require another condition for z at x = a. Let us assume that 2(a) = M; 
where M, is а “guess”. Note that M; represents the slope y(x) at x = a. 


Thus, the problem is reduced to a system two first-order equations with 
the initial conditions 


y =2 Уа) = А 
z'-f(x yz) 2(а) = М, (=y (a) (14.3) 


Equation (14.3) сап be solved for y and = using any one-step method 
using steps of ^, until the solution at x = b is reached. Let the estimated 
value of y(x) at x = b be B,. If B, = В, then we have obtained the required 
solution. In practice, it is very unlikely that our initial guess z(a) = Mis 
correct. 

If B, + B, then we obtain the solution with another guess, say 2(a) = 
M. Let the new estimate of у(х) at x = b be By (see Fig. 14.1). If B; is 
not equal to B, then the process may be continued until we obtain the 
correct estimate of y(b). However, the procedure can be accelerated by 
using an improved guess for z(a) after the estimates of B, and B, are 
obtained. 
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уо) 

i 

Direction of =н 

shooting (уа) = М) — iB 

— B, 

" ңа) = M) ! 
i, eae dest 

w) (x) 


Fig. 14.1 lustration of shooting method 


Let us assume that z(a) = M; leads to the value y(b) = B. If we assume 
that the values of M and B are linearly related, then 


Then 
B-B; 
My =M, + a x (M, - M) 
Bs 
=M,- 22-7 x(M, (14.4) 
h- a. x (M;- 


Now with z(a) = Му, we can again obtain the solution of у(х). 


PU  — —  àA^—^ 


Using shooting method, solve the equation 
d? 


БЕА =бх, У1)=2, y2-9 


=z у1)=2 


Let us assume that z(1) = y'(1) = 2(M,). Applying Hei n's method, we 
obtain the solution as follows: 
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Iteration 1 
h=05 
xsl, 30-32, el) ay =2 
m,(1) =29=2 
m,(2) = 6x, = 6 
т) = zy + hm (2) = 2 + 0.56) = 5 
m2) = 653 + A) = 6(1.5) = 
т1(0+ту() 2+5 _ 


mao) = 


2 БЕ: 
т.102) + т2(2) 649 
УА eB 

(2) 2 2 


убх) = у(1.5) = (D + MOA = 2 + 3.5 x 0.5 = 3.75 
2(x,) = 2(1.5) = 2(1) + m(25 = 2 + 7.5 x 0.5 = 5.75 
Iteration 2 


m,(2) = pa =9 
mj) =z; + hmí2) = 5.75 + 0.5 x 9 = 10.25 
m2) = бху + h) = 

5.75 + 10.25 


m) mE: m. 


m(2) it? = 10.5 


y(xg) = (2) = у(1) + m(1)h = 3.75 + 8x 0.5 = 7.75 
This gives В, = 7.75 which is less than B = 9 
Now, let us assume 2(1) = y '(1) = 4(М„) and again estimate у(2). 
Iteration 1 
hz05 
DI 
т) = 29 = 4 
m,(2) = 6: = 6 
m1) = г + hm,(2) 24 05x62 7 
т) = 6(xy + h) = 6(1.5) = 9 


4+7 
1) =——— = 5.5 
т(1) 2 


2974 
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6+9 
m(2) mo add 75 
ya) = y(1.5) = 2 + 5.5 x 052475 
201) = 21.5) = 447.5% 05=7 
Iteration 2 
h=05 
215, ур=4Л5, 2, = 7.75 
т) = 2, = 7.75 
m,(2) = 6x, =9 
m1) = 2, + hm,(2) = 7/75 + 0.5 x 9 = 12.25 
m4(2) = 6(x, + №) = 12 
_1765+12.25_ 


m(1) 3 = 


9+12 
m(2) = рә 10.5 


This gives В, = 9.75 which is greater than В = 9. 
Now, let us have the third estimate of z(1) = M; using the relationship 
(14.4) 


= 4 – 0.75 = 3.25 
The new estimate for z(1) = y*(1) = 3.25 
Iteration 1 
h=05 


%=1, w=% 22335 
тү1)= z, = 3.25 
тд) = 6x, = 6 
m1) = zg + hm4(2) = 3.25 + 0.5 x 6 = 6.25 
m2) = (xy +h) = 9 
3.25 + 6.25 
BE: 


т = =4.75 


9 
m(2) 279.25 
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1.5) = 2 + 425 x 0.5 = 4.375 


2(1.5) = 3.25 +75 x =7 
Iteration 2 
һ=05 
3-215, у= 375, 2,27 
т(1)=2 = 7 


my(2) = 6x, 26x 1529 
т) = 2, + hm(2) 27 + 0.5 x9 = 115 
ту(2) = 602 +h) = 12 


m(1) 2231128 _ 9 95 
m(2) = Hm -10.5 


у(2) = 4.375 + 9.25 x 05 = 9 


The solution is y(1) = 2, y(1.5) = 4.375, у(2) = 9 The exact solution is 
Уб) = x? + 1 and therefore y(1.5) = 4.378. 


The sequence of procedures for implementir the shooting method is 
given in Algorithm 14.1. 


1, Convert the problem into an initial-value problem. 

2. Initialise the variables including two guesses al the initial slope. 

3. Solve the equations with these guesses using either a one-step or a 
multistep method 

4. Interpolate from these results to find an improved value of the slopes 
obtained, 

5. Repeat the process until a specified accuracy in the final function 

value is obtained (or until а limit to the number of iterations is reached). 


Ea] FINITE DIFFERENCE METHOD 


In this method, the derivatives are replaced by their finite difference 
equivalents, thus converting the differential equation into a system of 
algebraic equations. For example, we can use the following “central 
difference” approximations: 
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Уа-а 


14.5) 
2h (14.5) 


Xi 2Yi Xia 
FEE NS 
These are second-order equations and the accuracy of estimates can be 
improved by using higher-order equations. 

The given interval (a, b) is divided into n subintervals, cach of width 
h. Then. 


(14.6) 


x o2x9*ih-a*ih 
у, = убх) = уба + th) 
Yo = ya) 

y, = (a + nh) = yib) 


This is illustrated in Fig. 14.2. The difference equation is written for 
each of the internal points i = 1, 2, ..., n — 1. If the DE is linear, this will 
result with (n — 1) unknowns уз, уу, ...› Yn 1. We can solve for these 
unknowns using any of the elimination methods. 


nx 


— + - 


x 
Fig. 14.2. Solution of DE by finite difference method 
Note that smaller the size of h, more the subintervals and, therefore, 


more are the equations to be solved. However, a smaller л yields better 
estimates. 


Given the equation 


d?y 


=e" with 50-20, 51-0 


dx? 
estimate the values of у(х) at x = 0.25, 0.5 and 0.75 


We know that 
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Yo =y(0) = 0 
y = 0.25) 
у» = (0.5) 
уз = 0.75) 
PEFTEEI 
h =0.25 


zt 
і=1х= 025 


УГ = 


= 3, х = 0.75 


y= 


aJi +a _ 


_ Уа 2У1 * Yo 


_2з3—32уз2 +У1 


Уа 72ys * Y2 


2 
ех 
A? 


(0.28)? = 1.0645 


0.0625 
Уз — 2y + Yo = 0.0665 


a) 


«айй ө. 
0.0625 е“ 1.2840 


Ya- 293 + у = 0.0803 


(2) 


=e (0,76)? = 
0.0625 е'0Л8)1 = 1.7551 


Y. +y = 0.1097 


(3) 


Letting yp = 0 and y, = 0, we have the fol wing system of three equations. 
-2y, + yy = 0.0665 
у — ®у; + уз = 0.0803 


уз – 2з = 0.1097 


Solution of these equations results in 
У, = 00.25) = — 0.1175 


уз = y(0.75) = – 0.1391 


The major steps of finite difference method are given in Algorithm 14.2. 
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Finite Difference Method 


1. Divide the given interval into n subinterval- 

2. At each point of x, obtain difference equation using a suitable 
difference formula. This will result in (n — 1) equations with (n — 1) 
UNKNOWNS, уз, Yaron y, 1. 

3. Solve for y, i= 1, 2, .., n — 1 using any of the standard elimination 
methods. 


Algorithm 14.2 


СИҢ sovne EIGENVALUE PROBLEMS 


As mentioned earlier, some boundary value problems, when simplified, 
may result in a set of homogencous equation of the type 
бац – AY, ax, +... + ах, =0 
ацху + (yy — A) + ap x, = 0 (14.7) 
Gy Xy + Upata +... + (а, Ах, = 0 
where À is a scalar constant. Equation (14.7) may be expressod as 
IA - AJ [X] - 0 (14.8) 
where I is the identity matrix and [A — AI] is called the characteristic 
matrix of the coefficient matrix A. 

The homogeneous Eq. (14.8) will have a non-trivial solution if, and 
only if, the characteristic matrix is singular. That is, the matrix |A — A I| 
is not invertible. Then, we have 

|A-al] (14.9) 


Expansion of the determinant will result in a polynomial of degree л in 


a -р,А"-1р,А"-? —P,-1A-p, =0 (14.10) 
Equation (14.10) will have л roots 2, 2; Àn. The equation is known 
as the characteristic polynomial (or characteristic equation) and the roots 
are known as the eigenvalues or characteristic values of the matrix А. 
The solution vectors Ху, Xy, ..., X, corresponding to the eigenvalues Ay 
^, А, are called the eigenvectors. 
The roots representing the eigenvalues may be real distinct, real re- 
peated, or complex, depending on the nature of the coefficient matrix A. 
The coefficients p, of the characteristic polynomial are functions of the 
matrix elements a, and must be determined before the polynomial can 
be used. | 
Example 14.3 illustrates the procedure vi valuation of eigenvalues 
and eigenvectors of а simple system 
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Ea 
Find the eigenvectors of the following system: 
8x, 4x, = Алу 


2x, + 2x, = Ах, 


The characteristic equation of the given system is 


That is 
(8- A(2-4)4+8=0 
or 
i*-1014820 


The roots are 


For А = 6, we get 
2y, — 4x, - 0 
2: - dx, =0 
Therefore x, = 2x, and the corresponding eigenvector is 


x-fi] 


Similarly, for 4 = Аз = 4, we get x, = x, and the eigenvector is 


The process of finding the eigenvalues and eigenvectors of large matri- 
ces is complex and involves a multistep procedure. There are several 
methods available and a discussion on all these methods will be beyond 
the scope of this book. We consider, in the next two sections, the follow- 
ing two methods: 

1. Polynomial method 

2. Power method 


ET POLYNOMIAL METHOD 


The polynomial method consists of the following three steps: 
1. Determine the coefficients р; of the characteristic polynomial using 
the Fadeev-Leverrier method 
2. Evaluate the roots (eigenvalues) of the characteristic polynomial 
using any of the root-finding techniques 
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3. Calculate the eigenvectors using any of the reduction techniques 
such as Gauss elimination 


The Fadeev-Leverrier Method 


The Fadeev-Leverrier method evaluate the coefficients p; i = 1, 2, ..., n, 
of the characteristic polynomial 


A" pA" pA" i - ... ~p, = 0 
The mothod consists of generating a sequence of matrices A, that can be 
employed to determine the p, values. The process is as follows: 
A,=A' 4.11) 
py tA; 


Remaining values (i = 2, 3, ..., n) are evaluated from the recursive 
equations: 


A, = ACA, _, 7 p, D 


tA, 
ГА prie (14,12) 


where t, А, is the trace of the matrix А, Remember, the trace is the sum 
of the diagonal elements of the matrix. 


Ш ___  __ 


Determine the coefficients of the characteristic polynomial of the system 
(-1- Ax, =0 
a, + (C2—A)xy +32, =0 
2x,* C3- AI - 0 
using the Fadeev-Leverrier method. 


48 - pA -pad - p,2 0 
The matrix A is given by 


0 о 
2 3 


By using the Еа. (14.11), 

A =A 

Dy Ау = -6 
By using the Eq. (14.12), 

A, = AA, — p.) 
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10 0 
12 3 


[= 0 
| 
0 2 -3j| 


0 
1 -2 3 
io 2 -3 


-1 0 015 0 0] 
ER 


=|1 -2 3 
о 2 -3]02 3 
-5 0 0 
=|-3 -2 3 
2 X 4 


Similarly, 


Therefore, the characteristic polynomial is 
ДЗ +642 + 52=0 
ог 


AA? + 6А + 5) 
[ы pre 4A... ————— 


Evaluating the Eigenvalues 
Let us consider the characteristic polynomial obtained in Example 14.4. 
AQ? +6145) =0 


One of the roots is А, = 0. The other two roots can be obtained using the 
familiar quadratic formula as 
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г АЕР 
Аз=-5 
Since it is a quadratic equation, we can solve it using the quadratic 
formula. However, if the polynomial is of hue. order, the roots may be 
evaluated using the techniques discussed in Chapter 6. We may use 
either the Newton-Raphson method with synthetic division or the 
Bairstow method. 
Remember that the sum of the eigenvalues of a matrix is equal to the 
trace of that matrix. In the problem discussed above, 
trace of A = -1-2-3-—6 
Sum of eigenvalues = 0- 1-5 =-6 


Determining the Eigenvectors 


Once eigenvalues are evaluated, eigenvectors corresponding to these 
eigenvalues may be obtained by applying Gauss elimination method to 
the homogeneous equations. Example 14.5 illustrates this. 


Ààz0 A--L A= 


Eigenvector 1 (A, = 0) 
The system of equations for A, = 0 is 


—20 
ху- 2х. + Зху= 0 
2x,-83,20 

‘This is equivalent to the system of two equations 
x,=0 
2x, — 3x5 =0 
Choosing n=l, 


ху = 2/3 = 0.6667 
Kigenvector 2 (À, = —1) 
Xi — Xo + 3x,=0 
2x2- 2x3= 0 
Choosing хз = 1, weget rą=1 and x,--2 
Eigenvector З (Aq = — 5) 
4x,=0 
xı + 3x2 +323=0 
2x,42x,-0 
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хү= 0 and choosing x,=1, weget x,--1 
The three eigenvectors are: 


Computing Algorithm 


The computing algorithm for polynomial method combines three differ- 
ent algorithms discussed so far. Algorithm 14.3 lists the major steps 
involved in implementing the polynomial method of finding eigenvalues 
and associated eigenvectors. 


Polynomial Method 


Input order of the matrix and the elements of the matrix. 

2. Determine the coefficients of the characteristic polynomial by using 
the Fadoov-Leverrier method. 

3, Evaluate the roots of the polynomial using the Bairstow method (use 
Algorithm 6.10). 

4. For each eigenvalue, construct the system of equations and then 
solve for the eigenvector using the Gauss elimination method (use 
Algorithm 7.2). 

5. Print eigenvalues and the associated eigenvectors. 


| Algorithm 143] 


HE POWER METHOD 


Power method is a ‘single value’ method used for determining the ‘domi- 
nant’ eigenvalue of a matrix. It is an iterative method implemented 
using an initial starting vector X. The starting vector can be arbitrary if 
no suitable approximation is available. Power method is implemented as 


follows: 
C AX 5 (14.13) 
Co = i (14.14) 
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The new value of X obtained from Eq. (14.14) is used in Eq. (14.13) to 
compute a new value of Y and the process is repeated until the desired 
level of accuracy is obtained. The parameter k, known as the scaling 
factor, is the element of Y with the largest magr';ude. 

Let us assume that the eigenvalues are |4| > |A| >... 2|A,| and 
the corresponding eigenvectors are X,, X}, ..., X,- After repeated applica 
tions of Eqs (14.13) and (14.14), the vector X converges to X, and Ё 
converges to À}. 


шт [ i 


Find the largest eigenvalue д; and the corresponding eigenvector V, of 
the matrix 


| 
x-|1| 
0] 
Equations (14.13) and (14,14) аге repeatedly used as follows 


Keration 1 


ИҢ 


io o -1}{0} [o] 

112] 1 | 
X==/1/=|0.5; 

RH 


Iteration 2 
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The process is continued and the results are tabulated below: 


Iteration 1 2 3 4 5 6 7 
20 20 28 286 298 298 3.0 
Y 10 25 26 2.3 296 299 30 


00 00 00 00 оо 00 00 

0 10 08 10 098 10 10 10 

x 1 05 1.0 093 10 0.99 1.0 1.0 
0 00 00 00 00 00 00 00 


The final entry in the table shows that A, = 3.0 (element of Y with 
largest magnitude) and the corresponding eigenvector is the last X. That 


15, 
1 
x,|i 
0 


Algorithm 14.4 gives an implementation of the power method. Note that 
the stopping criterion is based on the successive values of the vector X. 
There might be circumstances where the process will not converge at all 
(or where it will converge very slowly). Therefore, it is necessary to put а 
limitation on the number of iterations. 


1. Input matrix A, initial vector X, error tolerance (EPS) and maximum 
iterations permitted (MAXIT). 
Compute Y - AX 
. Find the element k of Y that is largest in magnitude. 
. Compute X = Үк 
. M [X Ха] < EPS or Iterations > MAXIT 
write k and X 
else 
go to step 2 


тлом 


| Algorithm 14.4 | 


Engineers often come across cases where they are interested in the 
smallest eigenvalue of the system. The smallest eigenvalue can be deter- 
mined by applying the power method to the matrix inverse of [A]. 


EE SUMMARY 


We considered two classes of problems in this chapter: 
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* boundary value problems 

* eigenvalue problems 
We presented two methods for solving boundary-value problems: 

* shooting method 

* finite difference method 

We also discussed in detail the nature and solution of eigenvalue 
problems and presented two methods for evaluating eigenvalues and 
eigenvectors: 

* power method 

* polynomial method 


| ке, Terms | 


Boundary value problem Finite difference method 
Characteristic equation Identity matrix 
Characteristic matrix Initial-value problem 
Characteristic polynomiat Polynomial method 
Characteristic values Power method 
Eigenvalue problem Scaling factor 
Eigenvalues Shooting method 
Eigenvector Trace of the matrix 


Fadeev-Leverrier method 


1, What is a boundary-value problom? How is it different from an 
initial-value problem? 
2. State the two popular methods used for solving boundary-value 
problems. 
3. What are eigenvalue problems? How are they different from bound- 
ary-value problems? 
4. State at least two methods used for solving eigenvalue problems. 
5. Describe the shooting method with graphical illustration. 
6. Explain the concept employed in the finite difference method 
7. Define the following: 
(a) Identity matrix 
(b) Characteristic matrix 
(c) Characteristic polynomial or equation 
(d) Eigenvalues 
(е) Eigenvectors 
(0 Trace of a matrix 
. What is Fadeev-Leverrier method used for? Explain. 
- Describe the algorithm of polynomial method used for solving eigen- 
value problems. 
10. Describe the implementation of power method with the help of a 
flow chart 


т 
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1. Solve the following equations using the shooting method. 


азу 

(а) 2 = 6+4, у00)=2 and (1) =5 
© 
2 

() SX i29, yay=2 ad y(2)=17 
d 


2. Use the finite difference approach to solve the equations in Exer- 
cise 1 with Ax = 0.2. 

3. Use the shooting method to solve the following differential equa- 
tion: 


given the boundary conditions у(0) = 10 and y(10) = 6. 
4, Solve Exercise 3 using the finite difference method with Ax = 2. 
5. Given the boundary-value problem 


Qf) арду y0-21 and 0-1 
dx? 


obtain its solution in the range 0 < x € 1 with Ax = 0.25 using 
(a) shooting method 
(b) finite difference method 
6. Given the equation 
2 
x E cas ayeas sinx=0 
and boundary conditions y(1) = 1 and y(2) = 2, estimate (1.25), 
y(1,5) and y(1.75) using the shooting method. 
7. Solve the following boundary-value problems using a suitable meth- 
od. 


(а) бу? = 0, У0)=1 and (2) = 0.25 
агу 

(b) Le = 0, У00=0 and (2) =1 

(Q 953.39 ор goi and 0-4 
uu 5050 Os › 
dar- dy " 

@ Ga tg ya 300--2 ad 0-1 


8. Find the characteristic polynomials of the following systems using 
Fadeev-Leverrier method. 


(а) 2 inan 
(a) 2x, + 8x) 4 10xg se Ax, 
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8x, + 3x, +42, 2x, 
10x, + 4x, + а= Аха 
(b) 16x, - 24x; + 18x, =Ax, 
3x; - 2x, X2 
-9x, + 18x3- l7, = Ax; 
(c) 2x, + 24 + 2x57 Ax, 
2x, + 5x) + 5ху= Ах 
2x, + 5» + lay Азу 
9. Evaluate the eigenvectors of the systems given in Exercise 8. 
10. Find the largest eigenvalue and the corresponding eigenvector of 
the following matrices using the power method. 


[2: A 0] 
(a) A=j-1 2 -1 
0-12 


Develop a program to implement the shooting method algorithm 
for a linear, second-order ordinary differential equation. 
2. Develop a program to implement the finite difference method of 
solving a linear, second-order ordinary differential equation. 
3. Develop а modular program which uses the following subprograms 
to implement the polynomial method 
(a) Subprogam to obtain the coefficients of the characteristic 
polynomial using Fadeev-Leverrier method. 
(b) Subprogram to evaluate the roots of the characteristic poly- 
nomial (use Bairstow method). 
Subprogram to determine the eigenvectors (use Gauss elimi- 
nation method). 
You may also use subprograms to receive input information and 
print output information. 
4. Develop a user-friendly program to evaluate the largest eigenvalue 
and the corresponding eigenvector using the power method. 
5. Develop a program to compute the smallest eigenvalue using the 
power method. 


(с 


Solution of Partial 
Differential Equations 


т NEED AND SCOPE 


Many physical phenomena in applied science and engineering when 

formulated into mathematical models fall into a category of systems 

known as рага! differential equations. A partial differential equation is 

a differential equation involving more than one independent variable. 

‘These variables determine the behaviour ol the dependent variable as 

described by their partial derivatives contained in the equation. Some of 

the problems which lend themselves to partial differential equations 

include: 

. Study of displacement of a vibrating string, 

. Heat flow problems, 

Fluid flow analysis, " 
. Flectrical potential distribution, 
Analysis of torsion in a bar subject to twisting, 

. Study of diffusion of matter, and so on. 

Most of these problems can be formulated as second-order partial 

differential equations (with the highest order of derivative being the 

second). If we represent the dependent variable as f and the two 

independent variables as x and y, then we will have three possible second- 

af af ә” 

ox?” дхду 

first-order partial derivatives "^. and L. 
ox ду 


оо шм 


order partial derivatives in addition to the two 


We can write a second-order equation invoiving two independent vari- 
ables in general form as 
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af R d of of 

* p 

2х1 °* aay dy? Ox’ ду 
where the coefficients a, В, and c may be constants or functions of x and 
у. Depending on the values of these coefficients, Eq. (15.1) may be classi- 
fied into one of the three types of equations, namely, elliptic, parabolic, 
and Ayperbolic 


(5.1) 


Fs». 


Elliptie, ifb? — 4ac < 0 

Parabolic, ifb? — 4ac = 0 

Hyperbolic, ifb? —4ac > 0 
If a, b, and c are functions of x and y, then depending on the values of 
these coefficients at various points in the domain under consideration, 
an equation may change from one classification to another. 

Solution of partial differential equations is too important. to ignore 
but too difficult to cover in depth in an introductory book. Since the 
application of analytical methods becomes more complex, we seek the 
help of numerical techniques to solve partial differential equations, 
There are basically two numerical techniques, namely, finite-difference 
method and finite-element method that can be used to solve partial 
differential equations (PDEs). Although the finite-element. method is 
very importaat for solving equations where regions are irregular, 
discussion on this technique is beyond the scope of this book, We will 
discuss here the application of finite-difference methods only, which are 
based on formulae for approximating the first and second derivatives of 
a function. We will also consider problems, only those where the 
coefficients a, 6, and ¢ are constants. 


BA DERIVING DIFFERENCE EQUATIONS 


In this section, we will discuss two-dimensional problems only. Consider 
a two-dimensional solution domain as shown in Fig. 15.1. The domain is 
split into regular rectangular grids of width А and height k. The pivotal 
values at the points of intersection (known as grid points or nodes) are 
denoted by fu which is a function of the two space variables x and y. 


Mayth 
Улту +ћ 


Xar Xo Gey Xea 


Fig. 15.1 Two-dimensional finite difference grid 
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In the finite difference method, we replace derivatives that occur in 
the PDE by their finite difference equivalents. We then write the 
difference equation corresponding ‘ each “grid point” (where derivative 
is required) using function values at the surrounding grid points. Solving 
these equations simultaneously gives values for the function at each 
grid point. 

We have a. ssed in Chapter 11 that, if the function f(x) has a 
continuous four.a derivative, then its first and second derivatives are 
given by the following central difference approximations, 


fx +h)- f(x - h) 


fe) = ui 
or 
fisı- 
m 152 
fi Jk (15.2) 
m fei d h)-2f(x,)* fx; А) 
he 

or 

„afea Mf fea (15.3) 


i he 
‘The subscript on f indicates the x-value at which the function is evaluat- 
ed. 


When f is a function of two variables x and y, the partial derivatives of 
f with respect x (or y) are the ordinary derivatives of f with respect to 
x (or y) when y (or x) does not change. We can use Eqs (15.2) and (15.3) 
in the x-direction to determine derivatives with respect to x and in the 
y-direction to determine derivatives with respect to y. Thus, we have 


Sinon ^ Hb fee dam 
Hon) „д „аР 
шай he Iti eh 
ой efi d "Decii a Bd 


O?f(xsy) fis Ура) -fis Y) -fæi Yy a) +f AY- 


dx dy Shk 
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It is convenient to use double subscripts i, j on f to indicate x and y 
values. Then, the above equations become 


gc fena (15.4) 
б Ыза (15.5) 
al Ны, tious (15.6) 
fy ei ыбы Е: 6.7) 
fd o taa эб nthga (15.8) 


Ahk 


We will use these finite difference equivalents of the partial derivatives 
to construct various types of differential equations. 


EB ELLIPTIC EQUATIONS 


Elliptic equations are governed by conditions on the boundary of closed 
domain. We consider here the two most commonly encountered elliptic 
equations, namely, 

Laplace's equation, and Poisson's equation. 


Laplace's Equation 
Equation (15.1), when a = 1, b 20, c = 1, and Fix, у, f. fos fy) = 0, becomes 
ha ДЗЕ Lau 
[T ME шыда Ашы (15.9) 
The operator 
ð 2 
Via Sg a 
e "ay? ) 


is called the Laplacian operator and Ед. (15.9) is called Laplace’s equa- 
tion. (Many authors use u in place of f.) 

To solve the Laplace equation on а region in the xy-plane, we subdi- 
vide the region as shown in Fig. 15.1. Consider the portion of the region 
near (x,, y,). We have to approximate 


EN s de 
VARI p 
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Replacing the second-order derivatives by their finite difference equiva- 
lents (from Eqs (15.6) and (15.7)) at the point (х, y), we obtain, 
Бл? tfiv, fuia 2Ёу tha 

he ii k2 
If ме assume,.  implicity, Л = k, then we get 


Уа = 


Vio ү autas А+ ажа e 0510 


Note that Ед. (15.10) contains four neighbouring points around the central 
point (x,, y,) (on all the four sides) as shown in Fig. 15.2. Equation (15.10) 
is known as the five-point difference formula for Laplace's equation. 


1+1 


ГР) 
Fig. 15.2 Grid for Laplace's equation 


We can also represent the relationship of pivotal values pictorially as 
in Eq. (15.11). 


(15.11) 


From Eq. (15.10) we can show that the function value at the grid point 
(ху, yj) is the average of the values at the four adjoining points. That is, 


= Met fiot hase thaw | (1549) 


To evaluate numerically the solution of Laplace's equation at the grid 
points, we can apply Eq. (15.12) at the grid points where f, is required 
(or unknown), thus obtaining a system of linear equations in the pivotal 
values f. The system of linear equations may be solved using either 
direct methods or iterative methods. 
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ВЕР | i 


Consider a steel plate of size 15 cm x 15 cm. If two of the sides are held 
at 100°C and the other two sides are held at 0°C, what are the steady- 
state temperature. at interior points assuming а a grid size of 5 cm x Бет, 


A problem with the values known on | each b boundary i is said to have 
Dirichlet boundary conditions. The problem is illustrated below. 


The system of equations is as follows: 
At point 1: fa + fa- 4f + 100+ 100 =0 
At point 2: f,*f,-4f,«100« 0 =0 
At point 3: f, + fy- 4fa + 100 + 0 
Al point 4: /,*/5-4f,* 0+ о 
That ів, 


o 
0 


-Af + fa + fa + 0-200 
fi+-4fa+ 0+ f= -100 
fi 0- 4h +f = -100 
0+h+h-4h,= 0 
Solution of this system are 
h=75 = 50 
Б =50 fy = 25 
Note that there is a symmetry in the temperature distribution, i.e. it can 
be stated that 
А-А 


ана therefore the number of equations in Example 15.1 may be reduced 
to three with three unknowns as shown below. 


~4f; + 2f, = -200 
fí- Af. + f = -100 (15.13) 
fs - 2f, 
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liebmann's Iterative Method 


We know that a diagonally domine-t system of linear equations can be 
solved by iteration methods such г.. Gauss-Seidel method. When such an 
iteration is applied to Laplace’s equation, the iterative method is called 
Liebmann's iterative method. 

To obtain .. — ivotal values of f by Liebmann's iterative method, we 
solve for f; the . „uations obtained from Eq. (15.10). That is, 


| = Mss itf gei fua (15.14) 


The value /; at the point ij is the average of the values of f at the four 
adjoining points. If we know the “initial values” of the functions at the 
right-hand side of Eq. (15.13), we can estimate the value f at the point ij. 
We can substitute the values thus obtained into the right-hand side to 
achieve improved approximations. This process may continue till the 
values f; converge to constant values. 

Initial values may be obtained by either taking diagonal average or 
cross average of the adjoining four points. 


Гу + +100 
pipa 


100 
Ñ „йэй (15.15) 


Appropriate initial values for the iterative solution are obtained by tak- 
ing diagonal average at 1 and cross average at other points, assuming 
first f, = 0. 


f, = (000 + 100 + 100 + 0) = 75.00 (average) 


hs nus 100 + 0 + 0) = 43.75 
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f= {Чоо + 75 + 0 + 0) = 43.75 


f, = 344375 + 49.75 + 0 + 0) = 21.88 


Note that f2, fs, and f, are computed using the latest values on the right- 
hand side. 

Using these initial values in Eq. (15.15) and performing iterations 
gives the values as shown in Table 15.1. 


Table 15.1 


Iterations 
1 y 3 4 


71.88 74.22 74.81 74.95 
48.44 49.61 40.90 49.98 
48.44 49.61 49.90 49.98 
2422 2481 24.95 24.99 


The process may be continued till we get identical values in the last two 
columns. Note that the values are approaching to correct answers ob- 
tained in Example 15.1. 


Poisson's Equation 


Equation (16.1), when a = 1, b = 0, c = 1 and Fix, у, f, f, f) = gle, у), 
һесотев 


af gf p 
af yr ae (15.16) 
ог 
V*f - glx, y) 


Equation (15.16) is called Poísson's equation. Using the notation g, = 
g(x, у). Eq. 15.10 used for Laplace's equation may be modified to solve 
Eq. 15.16. The finite difference formula for solving Poisson's equation 
then takes the form 


Meus thevst garth EUN (1517) 
By applying the replacement formula to each grid point in the domain of 
consideration, we will get a system of linear equations in terms of f;. 


These equations may be solved either by any of the elimination methods 
or by any iteration techniques as done in solving Laplace's equation. 


Solve the Poisson equation 

vif-2xy 
over the square domain 0 <x <3 and 0 < y < 3 with f= 0 on the boundary 
апал = 1. 
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The comain is divided into squares of one unit size as illustrated below: 


0 x-:1  x-2 0 


By applying Eq. (15.17) at each grid point, we get the following set of 
equations: 


Ponti: 0+0+/5+%%-4/, = 212) 


ie — fo*h-4h =8 (a) 
Point2: 0+0+h +f- 4f, = 2227 
ie — fhíá-Af +, =32 (b) 
Point 3: 0404f *f,-4fs = 2070" 
ie — f-4f*f =2 (e) 
Point 4: 0+0+A+h—4h, = ADOP 
ie fath-4h=8 (4) 
Rearranging the equations (a) to (d), we get 
-4fithth = 8 


А-4 +h =32 
h -4ht+h= 2 


f*fs- Af, = 8 
Solving these equations by elimination method, we get the answers. 
„Ж | 
һе -22, = = 


h=ithth-® 
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1 4 
һ= т (б +һ-3% 
f= 10+ 2) 
3 4 1 4 


f= 145-8 


Note that f, = f, 
Therefore, 


һ= id fs m 
f= Rf, - 92) 


f= ion-2 


Assume starting values as f} = 0 = f, 
Iteration 1 

hs o f--M /=-1 
Iteration 2 


Tm T em 
fh rH, get 
Iteration 3 
22 43 
ће. hep hee 
Iteration 4 


Газ] PARABOLIC EQUATIONS 


Elliptic equations studied previously describe problems that are time- 
independent. Such problems are known as steady-state problems, But 
we come across problems that are not steady-state. This means that the 
function is dependent on both space and time. Parabolic equations, for 
which 


b-4ac = 0 
describe the problems that depend on space and time variables. 
A popular case for parabolic type of equation is the study of heat flow 
in one-dimensional direction in an insulated rod. Such problems are 
governed by both boundary and initial conditions. 
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Insulation 


to. 0) 


x=0 t Хе 


Fig. 15.3 Heat flow in a rod 


Let f represent the temperature at any point in rod (Fig. 15.3) whose 
distance from the left end is x. Heat is flowing from left to right under 
the influence of temperature gradient. The temperature f(x, t) in the rod 
at the position x and time /, is governed by the heat equation 


af 2f 
kı ac 5h (15.18) 
where А, = Coefficient of thermal conductivity; ё, = Specific heat; and 
hg = Density of the material. 

Equation (15.18) may be simplified as 


kf. ux, t) = fx, t) (15.19) 


where 


TEC 


kaka 


The initial condition will be the initial temperatures at all points along 
the rod. 


Го, 0)=fix),  0sxsL 


The boundary conditions ft0, t) and f(L, t) describe the temperature at 
each end of the rod as functions of time. If they are held constant, then 


fO,t=c,, 05245 
fü, 0) = с, O0tt«e 


Solution of Heat Equation 
We can solve the heat equation given by Eq. (15.19) using the finite 
difference formulae given below: 
(x,f T) - fíx,t 
Ane obra 

т 
=i 15.20) 
= Cah (15.20) 


fol, p = LER SD - 2F (x,t) + fet ht) 
ee оле 
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#т\/- ufi (15.21) 
Substituting (15.20) and (15.21) in (15.19), we obtain 
k Р r 
10, - = аа at fit) (15.22) 
Solving for f; «1 
271 tk 
ANE CES күй ы thes) 
ШЕ ОЕ ЖЕТУ. (15.28) 
where pi 
h? 


Bender-Schmidt Method 
The recurrence Eq, (15.23) allows us to evaluate f at each point x and at 


any time t. If we choose step sizes Aż and Ax such that 


1-2" = (15.24) 


Equation 15.23 simplifies to 


(15.26) 


Equation 15.25 is known as the Bender-Schmidt recurrence equation. 
This equation determines the value of f at x = x,, at time = t, + т, as the 
average of the values right and left of x, at time tj. 
Note that the step size in time At obtained from Eq. (15.24) 
«А* 
722% 
gives the Eq. (15.25). Equation (15.23) is stable, if and only if the step 
h2 


size t satisfies the condition г< a 


CS 08 


Solve the equation 
[09,0 =, D, O<t<s15 and б<х<4 
given the initial condition 
Ро, 0) = 5004 - х), 0<х<4 
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and the boundary conditions 
ГО, 2) = 0, ost<15 
f4,0=0, Osts15 


1 we assume Ax = h = 1, At = t must г 


Taking т = 0.25, м have 


hga ght he) 


Using this formula, we can generate successfully f(x, 2). The estimates 
are recorded in Table 15.2. At each interior point, the temperature at 
any single point is just average of the values at the adjacent points of 
the previous time value. 


Table 15.2 
h x 0.0 10 20 3.0 
TN fixo) 
0.00 об 1500 1000 500 Ре = буз -х) 


0.25 0.0 50.0 100.0 50.0 0.0 
0.50 0.0 50.0 50.0 60.0 0.0 


0.75 0.0 25.0 25.0 25.0 0.0 
1.00 0.0 12.5 25.0 12.5 0.0 
1.25 0.0 12.5 12.5 


6.25 12.5 


The Crank-Nicholson Method 


Solution of the parabolic equation given in Eq. (15.19) was solved using 
a forward difference formula in Eq. (15.20) for the time derivative and a 
central difference formula in Eq. (15.21) for the space derivative. This is 
called explicit method because all starting values are directly available 
from initial and boundary conditions and each new value is obtained 
from the values that are already known. 

‘Accuracy of the explicit method may be improved if we use central 
difference formulae for both time and space derivatives. The forward 
difference quotient used for the time derivative (Eq. (15.20)) 


fajsi- fii 
т 
may be treated as central difference if we consider it to represent the 
midpoint of the time interval (j, j + 1). We can also use the central 
difference quotient for the second derivative with distance, correspond- 
ing to the midpoint in time. Then, 
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fi{ze+3)-4 


The central difference quotient for the second-order space derivative is 
obtained by taking the average of difference quotients at the beginning 
and end of the time step, i.e. 


1-й) (15.26) 


Central difference at ғ 


tein = i (Central difference at t, + Central 


difference at t, , |). 


Then, 
Ыбы t а-ы Hiag “Шы. fimi n ux] 
(16.27) 
Then equating Eqs (15.26) and (15.27) and substituting 
eS. 
A? 


We get NE m 
EZ tyri +N a Hijet = rf ue t 


=f irf. 
j С (15.28) 
Equation (15.28) is called the Crank-Nicholson formula. If we let r =1, 
then Eq (15.28) simplifies to 

[бышуу ите и та | (18.29) 
The terms on the right-hand side are all known. Hence Eq. (15.29) forms 
а system of linear equations. The points used in the Crank-Nicholson 
formula are shown in Fig. 15.4. The boundary conditions are used in the 
first and last equations, i.e. 


ылып 

hu =fajsi =t 
The Crank-Nicholson formula is called an implicit method because the 
values to be computed are not just a function of values at the previous 
time step, but also involve the values at the same time step which are 
not readily available. This requires us to solve a set of simultaneous 
equations at each time step. 

Referring to Fig. 15.4, 
fa + Afa -fe = fo + fe 


ПАНА Aged Pede 


1-1.) Я] ety 
Fig. 15.4 The Crank-Nicholson grid 
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E c 


Solve the problem in Example 15.5 by the Crank-Nicholson implicit 
method 


Let us use a table as shown in Table 15.3 for recording the function 
values at various time steps. The values for the first time step (t = 0) are 
obtained from the initial condition 


f(x, 0) = 5004 – х) 
and the values for f, and f; are obtained from the boundary conditions. 
Table 15.3 

tj x=0 m$ х= х=4 
B h ho fs. 
0.0 150.00 100.00 50.00 0.0 
0.0 56.25 75.00 43.75 0.0 
0.0 0.0 
0.0 0.0 
0.0 0.0 


Now, for the second time step (1 = 0.25), we write equations at each point 
using Eq. (15.29) and solve for unknowns. Thus, for the second row in 
the table, we have 


-0.0+4f,-f,= 0.0 + 150 
-f+ 4fy—- f= 150 + 150 
—fy+4f,-0.0= 100 +0.0 
Solving these three equations for three unknowns, we obtain 


Р = 56.25 
fa = 15.00 
f= 43.75 


This process may be continued for each time step. Students may com- 
plete the table. 


IET HYPERBOLIC EQUATIONS 


Hyperbolie equations model the vibration of structures such as build- 
ings, beams and machines. We consider here the case of a vibrating 
string that is fixed at both the ends as shown in Fig. 15.5. 

The lateral displacement of string f varies with time ¢ and distance x 
along the string. The displacement f(x, t) is governed by Ше wave equation 
af af 
ox? "Раа 
where T is the tension in the string and р is the mass per unit length. 
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x=0 x xci f 


Fig. 15.5 Displacement of a vibrating string. 


Hyperbolic problems are also governed by both boundary and initial 
conditions, if time is one of the independent variables. Two boundary 
conditions for the vibrating string problem under consideration are 


f0,0-0 O<tsb 

f(L,t)=0 Ostsb 
Two initial conditions are 

f(x, 0) = Ах) Osxsa 

f(x, Ose) Osxsa 


Solution of Hyperbolic Equations 


The domain of interest, 0 < x < а and 0 < t < b, is partitioned as shown in 
Fig. 15.6. The rectangles of size Ax = Л and At = т 


b Ба 
А 
m I 
lo в At 
i 42 L 
| 
bes — E 
al. [| 
0 йл X ri * 


Fig. 15.6 Grid for solving hyperbolic equation 


The difference equations for f,,(x, t) and f,(x, t) are: 
(x-h;D- ht 
Lis ga echo 2л D fixe ht) 


ft - )-2f(x,0 fix tx) 
т? 


fide. t) = 
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This implies that, 


fion Эбы + л _ fuga faf 
T Ai sp E 


Solving this for /, ; . |, we obtain 


{. Tr?) Tr? 
fujsi 14-12 le ify tone eas +fi-1)) 


If we can make 


then, we have 


thew f (15.30) 


The value of f at x =x, and t = 4, + ris equal to the sum of the values of 
fat the point x = х, - A and x = x, * h at the time? = 4, (previous time) 
minus the value of f at x = x, at time t = t- t, From Fig. 15.6, we can say 
that, 


fa * fa + fo- fc 


Starting Values 


We need two rows of starting values, corresponding to j = 1 and j = 2 in 
order to compute the values at the third row. First row is obtained using 
the condition 

fx, 0) = fix) 
The second row can be obtained using the second initial condition ав 
follows: 

fix, 0) = glx) 
We know that 
i 


fosi- fio-i 
fix, 0) = — сш; 


fa = fia- 2 та, fort = 0 only 
Substituting this in Eq. (15.30), we get for t = t, 


)* 1g, (15.31) 


„1 


2 «otf 


In many cases, g(x,) = 0. Then, we have 


fao 20+) 
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B. ш 


Solve numerically the wave equation 
fax, 0 = Afal, 0), АЕ] 


with the boundary conditions 
fOt)=0 and f(6,t)=0 


and initial values 
f(x, 0) = f(x) = х(5 - x) 


x(5 =x) 
Equation (15.1) 
Equation (15.30) 


4 
and assuming 
та. 
1-4 43^ 0 
We get, 
e 
m 
The values estimated using Eqs (15.30) and (15.31) are tabulated in 
Table 15.4. 
Table 15.4 
pm 0 i 2 3 4 5 
oo оо 4 #6 6 4 — 00 
05 00 3 5 5 3 0.0 
10 0.0 1 2 2 1 0.0 
1.5 0.0 -1 -2 -2 -1 0.0 
20 00 3 -5 -5 -3 0.0 
2.5 00 -4 -6 -6 -4 0.0 


WE SUMMARY 


In this chapter, we discussed the solution of an important class of differ- 
ential equations called partial differential equations. Due to complexity 
and limited scope of this book, we considered only the finite-difference 
method of solving the PDE problems where the coefficients a, b, and с 
are constants. We presented the following in this chapter: 

• Definition and classification of partial differential equations. 


* Derivation of difference equations for PDEs. 


» Solution of Laplace's equation by *-- method of elimination. 


Sol 
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Liebmann's iterative method for solving Laplace's equation. 
Solution of Poisson's equation by bath direct and iterative methods. 
Solution of parabolic type heat equation using the explicit Bender- 
Schinidt recurrence equation and the implicit Crank-Nicholson for- 
mula. 

Solution of hyperbolic type wave equation by iterative procedure. 


[ Key Terms 


Bender-Schmidt equation Hyperboli equation 
Crank-Nicholson formula Implicit method 

Cross average 1аріасе'ѕ equation 
Diagonal average Laplacian operator 
Dirichlet boundary condition Liebmann's method 
Elliptic equation Parabolic equation 

Explicit method Partial derivatives 
Finite-difference method Partial differential equation 
Finite-alement method Poisson's equation 
Gauss-Seidel iteration Wave equation 


Heat equation 


. What is diagonal-averaging? When do we use it? 
. Derive a difference equation to represent a Poisson's equation. 
. Derive the five-point formula for Laplace's equation. 


‚ What is a partial differential equation? Give two examples. 
. State two real-life problems where partial differential equations 


are required to construct mathematical models. 


. Ном are the partial differential equations classified? Give an ex- 


ample from real-life situations for each type. 

What are the various methods available to solve differential equa- 
tions? 

Explain how difference quotients are applied to solve partial differ- 
ential equations, 


. What is Poisson’s equation? How does it differ from Laplace's equa- 


tion? 


. What is Liebmann's iteration method? What arc its advantages? 


What is meant by Dirichlet boundary conditions? 


What is Crank-Nicholson method? Why is it known as implicit. 
method? 


. What is Bender-Schmidt recurrence equation? Derive the formula. 
. Discuss the impact of size of the incremental width AT for the time 


variable t on the solution of а heat-flow equation. 
Outline the argument that demonstrates the stability of the finite- 
difference procedure for solving a hyperbolic equation. 


= 
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‚ Determine which of the following equations are elliptic, parabolic, 


and hyperbolic. 
(а) 35, + 4f, =0 
Ы) fa- hy = 0 
(©) fa Yoy + 25, = 2x + Бу 
(d) f, + fy + 4 fy = 0 
(е) fy-f,=0 
(D fa + fy «95, =0 


+ The steady-state two-dimensional heat-flow in a metal plate is by 


er * eT 

dx? ay? 
Given the boundary conditions as shown in the figure below, find 
the temperatures T, T», Ty, and Т,. 


0 5 100 100 


Solve for the steady-state temperatures in а rectangular plate 8 cm 
X 10 cm, if one 10 cm side is held at 50°С, and the other 10 em side 
is held at 30°C and the other two sides are held at 10°C, Assume 
square grids of size 2 em x 2 cm. 

Repeat Exercise 3 by Leibmann's method. 

Evaluate f(x, y) at the internal grid points of the given domain 
governed by Laplace's equation. Use Liebmann's iteration method. 


pp... 0. 100 o r——9 2» 9 
er—3-— 1——150 201—4 — 120 
0|— +t 00 50 50 
0 ——| = 150 20 20 
0 0 0 0 10 0 20 50 30 0 


10. 


11. 
12. 
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„ Torsion on a rectangular bar subject to twisting is governed by 


WT=-4 


Given the condition T = 0 on boundary, find T over a cross section 
of a bar of size 9 em x 12 cm. Use a grid size of 3 cm x 3 cm. 


‚ Solve the equation 


V*fz Р(х,у) 
with F(x, y) = xy and f = 0 on boundary. The domain is a square 
with corners at (0, 0) and (4, 4). Use h = 1. 


. Estimate the values at grid points of the following equations using 


Bender-Schmidt recurrence equation, Assume В = 1 
(a) f, —0.5f, = 0 j 
Given, 
РО, t) = 0, (5, ) = 0 
f(x, 0) = x5 — x) 
(b) Ifo 
Given, 
flO, 0 = — 5, 5,0 = 5 
[-8forü0sxs2.5 
fe. 0| stra5«x«5 


. Initial temperatures within an insulated cylindrical metal rod of 5 


cm long are given by 
Т = 20x forOsx<5 
where x is the distance from one face. Both the ends are main- 
tained at 0°C, Find the temperatures as a function of x and t if the 
heat flow is governed by 
4T, -T,=0 
Solve the following equation using Crank-Nicholson method. 


OF gf 
ax? at 
Given, 
f(0,020, А20, 0 = 10 
fx, 0) = 2.0 
Assume Ах 2h =5 andr=1 
Solve Exercise 9 with the Crank-Nicholson method with г = 1. 
Solve the following hyperbolic equations using finite difference meth- 
od. 
(a) fu = fa 
Given, 
ГО, t) = 0 and fí5, t = 0 
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f(x, 0) = 100x? (5 – х) 
f.(x, 0) - 0 
) fa = 4f. 
Given, 
РО, 0 = 0 and f(1, #) =0 
f(z, 0) = f(x) = sin (xx) + sin (2лх) 
f,(x,0) = 0 


1. Develop a program to solve Laplace's equation with Dirichlet condi- 
tions. 

2. Write a program to solve Poisson's equation. 

3. Develop a program using forward-difference method to solve the 
heat equation. 

4. Write a program to solve the heat equation using Crank-Nicholson 
method 

5. Write a program for finite-difference solution of the wave equation, 


Solution of Linear 
Systems by 
Matrix Methods 


E OVERVIEW OF MATRICES 


A matrix is a rectangular array of elements arranged in rows and col- 
umns, If the matrix contains m rows and n columns, the matrix is said 
to be of size m x n. The element in the ith row and jth column of the 
inatrix A is denoted by ay. For example, 


fou аш ew] 
anjon aa ae 


|аз аз аз 


[йаа 4a ав) 
is a 4 x 3 matrix. 


Types of Matrices 
Matrices may belong to one of the many types discussed here. 
Т. Square Matrix A matrix in which rows m is equal to columns n. 


2. Identity Matrix A square matrix in which all the diagonal elements 
are one and other elements are zero. That is 


for i=j 
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Identity matrices are denoted by I. For example, a 3 x 3 identity is 
written as 
10 0) 
1-10 10 
0 0 1| 


3. Row Vector A matrix with one row and n columns. 
4. Column Vector А matrix with m rows and one column. 


5. Transpose Matrix The matrix А” is called the transpose of A if the 
element a, in A is equal to element a; in А? for all i and j. For example, 
if 


then, 


It is clear that A" is obtained by interchanging the rows and the columns 
of A. 

6. Zero Matrix A matrix in which every element is zero. 

7. Equal Matrix A matrix is said to be equal to another matrix if and 
only if they have the same order and the corresponding elements are 


equal. That is, 
matrix A = matrix В 


if ay = by for all i and j. 


Matrix Algebra 
We can perform only three operations, namely, addition, subtraction and 
multiplication on the matrices. The division, although not defined, is 
replaced by the concept of inversion discussed later. 

Two matrices A and B can be added together (or subtracted from each 
other) if they are of the same order. Then 

sum C=A+B 
can be obtained by adding the corresponding elements, That is, 
oy = ay + by for all i andj 
Similarly, 
difference E=A-B 

can be obtained by subtracting the corresponding elements. That is, 


ey=ay—by for alliand? 
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Two matrices A and B can be multiplied in the order AB if and only if 
the number of columns of A is equal to the number of rows of B. That is, 
if A is of order m х r, then B should be of order r x n, where m and n are 
arbitrary values. In such cases, we may obtain 

P-AB 
which is of order m x n. Elements of the matrix P is given by 


ру = Laub, — for alli andj 
hol 


‘The following general properties apply to matrix algebra. 
1 АкВ=В+А 
2. А+ (В +С) = (А +В) +С 

3. (А + В = AT x BY 

4. IA = AL, where I is identity matrix 

5. (AB)C = A(BC) 

6. C(A+B)=CA+CB 

7. (A+ B)C = AC + BC 

8. a(AB)=(aA)B=A(aB), where a is a scalar 


Traces and Determinants 


The trace of a matrix is the number obtained by adding its diagonal 
elements (from upper left corner to the lower right corner), For example, 
the trace of the matrix 


[аи аш аз 
А=|а аф азд | 18 Gy, + Gon + 135 
аз аз аз; 
The determinant of а 2 x 2 matrix, say A, is written in the form 
lan аш 
an G29} 
and the determinant is computed as 
ац 02; — 032 ац 


Similarly, for a 3 x 3 matrix the determinant is given by 


би an ац 
аз йз az 
@зз аз; азз 


lA] = 


an аз 
|a; O32 


[= talann teles 


=ац -012 
as aa зї Gag 
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= а, (922038 — 02808) ~ азаазз ~ 25:025) 
+ ау(ащ@з; — 04223) 
= 011431033 + Giy mo- ` 213021032 — 011082023 
7 42112033 7 131023013 
Note that there are 6 product terms added together. For larger matrices, 
the determinant is much more difficult to define and compute manually. 
In general, for an п x n matrix, the determinant will contain a sum of n! 
signed product terms, each having л elements. 

Some of the important properties of determinants that would be help- 

ful in computing their values are: 

1. Interchanging two rows (or two columns) of a matrix changes the 
sign of the determinant but not the value. 

2. If two rows (or columns) of a matrix are identical then its determi- 
nant is zero. 

3. If a matrix contains a row (or column) of all zeros, then its determi- 
nant is zero. 

4, Value of the determinant of a matrix does not change when a 
scalar multiple of one row (or column) is added to another row (or 
column). 

5. If every element of a row (or column) is multiplied by a scalar u, 

the value of the determinant is multiplied by o. 

. IfA and B are square matrices of same size, then [АВ | = |A| |B]. 

. For a triangular matrix (in which all the elements below (or above) 
the diagonal are zero), the determinant is the product of the diago- 
nal elements. 


Minors and Cofactors 


The minor M, of the element a, of the determinant |A| is obtained by 
striking out the ith row and jth column. That is, 


а; а а 
My = а м apa 13 
3 E азу 2 ра аз 
and so on. That is, the minor of а particular element is the determinant 
that remains after the row and the column that contain the element 
have been deleted. 


The cofactor of an element is its minor with a sign attached. The 
cofactor dy of an element a, is given by 


d, = 1f Mj 
The value of the determinant of a matrix can be obtained by expanding 
the determinant by cofactors. This is done by choosing any column or 


row and determining the sum of the product of eac element in the 
chosen row or column and its cofactor. 


ле 
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Adjoint Matrix 


If d; is the cofactor of the element а, of the square matrix A, then, by 
definition, the adjoint matrix of A is given by 


adj(A) = D7 
where 
dy dig .. dy] 
Re du da "T ч 
dm Фа =. dan 
Inverse of a Matrix 


If B and С are two n x n square matrices such that 
BC = CB = I (Identity matrix) 


then, В is called the inverse of C and C the inverse of B. The common 
notation for inverses is B^ and C '. That is, 


BiB-I 
C'Csl 


The concept of matrix inversion is useful in solving linear systems of 
equations. 


SOLUTION OF LINEAR SYSTEMS BY 
DETERMINANTS 


We can solve a system of linear equations by determinants using a 
method called Cramer's Rule. For the sake of simplicity, we consider a 
2x 2 system such as 


ax, + bx =c 

dx, +ех =f 
We can solve for the variable x, by eliminating the variable a 
Thus, 
ce-bf. 


ae — bd 


Similarly, 


af - cd 
9-2 
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Alternatively, we can express x, and x, using determinants. That is, 


c bl а с 

T ® 
x= and х= 

а bl а bl 

а е d е 


This is known as Cramer's rule 


BH EM 


Solve the following system of equations using Cramer's rule: 
2x, + Bx, = 12 


р 1 

pe at an. 

TOM S -M 14 
k 


А SOLUTION OF LINEAR SYSTEMS BY 


MATRIX INVERSION 


A linear system of n equations in n unknowns can be represented in 
matrix form as 


AX=B 
where A, X and B are matrices and are given by 
ац аш e аһ | 
A-|em em - an | 


Әп a2 o а) 
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x by 
x-|?| вә 5 
xs] b. 


A is called coefficient matrix and B is known as constant vector. X is 
the required solution and, therefore, it is called the solution vector. If we 
multiply the matrix equation 

AX-B 
By A™ on both sides, we get 
АЛАХ = АСВ 
АС! is the inverse matrix of А. We know that A7! A = Г is the identity 
matrix and is given by 


| 100 0 
010 = 0 
fafo 0 1 0 
000. i 
Therefore, 
АЛАХ = IX=X 
is the solution of the system of equations and is obtained from 
X-A!B-CB 


If we know the inverse of the matrix A, we can obtain the solution vector 
X by post-multiplying it by B. 

‘The inverse A^ of a square matrix A exists, if and only if, A is 
nonsingular (i.e. det A #0). A” is the matrix obtained from A by replac- 
ing each element a, by its cofactor а, and then transposing the resulting 
matrix and dividing it by the determinant of A. 


(А) 

С=А = adj 

a det A 

where adj(A) is the adjoint of matrix A and is given by the transpose of 
the cofactor matrix of A. 

Then 


where 
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dy is the cofactor of the element a, and is given by 
d, = (-1/9 M, 


M, is called the minor of ay and is taken as the determinant of matrix A 
after deleting ith row and Ah column. 


Solve the following system using matrix inversion method 
2х tag +ху= 7 


-җ+=0 
Ax, +2х,-3ху=4 
[21 1 T 
Given, А=|1 -1 and В=/0 
l4 2 -s 4 
м. -fA 4-1 алеї 
i I 
Mas M--7 du=7 
1 2j 
Ms " Яз = 
Ma =|} ar 5 din 
Mas dn 
2 
My Л J 0 daz0 
1 у 
Mal +2 dy =2 
2 
и, = ]-i dasi 
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i 5 ж 
15 15 15 
7-0 -1 
c ьС=|-1- = 
Ж 8 15 15 15| 
6 -3 
15 9 1 | 


We know that X = СВ and therefore 


240-81 


х=49 -0. 4-8 


15 
242,9 12. 
SN MET UU 


БАЯ GAUSS-JORDAN MATRIX INVERSION 


The Gauss-Jordan elimination technique (discussed in Chapter 7) can be 
used to invert a matrix effectively. The square matrix A which is to be 
inverted is augmented by a unit matrix of the same order as follows. 


an ай оф {10 a € 
аз 02 аһ |0 1... 0 
On) Qn? Onn lo 0 1 


If we carry out Gauss-Jordan elimination using the first row as a pivot 
vow we get 
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Ж Gy = Gy аат 0 = 0 
0 an .. 02 | Gener 1 -~ 0 
@ tag sce йв [бин Ё s E 


When we repeat the process using the second row as the pivot row, the 
result is 


1 0 аз . Ow |@а+1 diae O 0 
0 1 аз аз |аъазї @зл+: 0 0 
0 0 азз .. аз | азл @зп+: 1 0 
0 0 ans o аһ (Annei aas ©... L 


This elimination process, if continued for all the n rows, yields the final 
result as follows: 


10 0..0 

Ainsi Gins? c @щп+л 
0 1 0 .. 
00 1..0 Ogn.1 @2л+1 +. @2п+л 
ооо .. 1|%һл*1 Gnmnsi w Gunsa 


The matrix a; for i = 1ton andj 2 n +1 to 2n is the inverse matrix of A. 


1,....п 


"PUE 
ска = ta, { n+1,...,2n 


where the element 


[2 foizl.,n and Jd, vim. 


“алу 


8B 8 


F'ind the inverse of the coefficient matrix of the system given in Example 
A2 using Gauss-Jordan elimination technique. 


42 
Pivot row-1 
1 v2 y2| v2 0 0 
0 -3/2 y2|-v2 1 0 
0 0 -5|-201 
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Pivot row-2 
10 2/3]V3 V3 0 
0 1 -V3|V3 -2/3 0 
00 -]|-2.0 1 
Pivot row-3 


0 1 0/7/15 -2/3 -V15 


0 0|115 V3 2/15 
ba 1125 0 -15 


The last three columns represent the inverse of the matrix 


2 1 1 
1 ч 1 
4 2 -3 


Compare this with the result obtained in the previous section. 
Once the inverse of A is known, the solution vector X can be obtained by 
simple matrix multiplication. That is 

Х=А'!В=СВ 

ху = сибү Cub; +... cub. 

Xq = субу + су +... + Cab, 
and so on. 

In general, 
х= Ecjb; i=l uon 
ia 

Note that cy = а, , + ; of augmented A in the final step. 


Adjoint matrix 
Coefficient matrix 
Cofactor 

Column vector 
Cramer's rule 
Determinant 
Equal matrix 
Identity matrix 


Solution of 
Polynomials by 
Graeffe’s Root 
Squaring Method 


Graeffe's root squaring method is a direct method of finding roots of a 
polynomial with real coefficients. This method deserves attention both 
for its historical interest and a novel idea involved 

Graeffe's method transforms a polynomial p,(x) into another polyno- 
mial of the same degree but whose roots are the squares of the roots of 
the original polynomial. Because of the squaring property, the roots of 
the new polynomial will be spread apart more widely than in the о g 
nal one, when the roots are greater than 1 in absolute value Repeating 
this process until the roots are really far apa ме can compute the 
roots directly from the coefficients. 

We consider here a simple example to illustrate the root squaring 
technique. 


Let 


Pott) = (x ~ Whe ~ 2-3) (B. 
Then, we consider another function p,(y) such that 
piy) = =polx) рух) 


= (х - 1(х-2(х-3 - 10-х - 20-х - 3) 


x — 1x - 2x — 3x + 1Xx 20x + 3) 
={х®- 1x" - 4ух* - 9) 
1Xy- 4Xy- 
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where 

ysx 
We know that the roots of pix) = 0 are x, = 1, х, = 2, and x, = 3. And 
from Fa, (B.2) the roots of p;(y) = 0 are y, = 1, y = 4, and y, = 9. It shows 
that roots of p;(y) = 0 are the squares of the roots of руу) = 0. This 
implies that if we compute the roots of p,(y), then we can obtain the 
roots of рох) from 


х= ууз 


Now, let us repeat the procedure for finding the roots of p,(y), Consider 
a third polynomial 


P22) = -py() py) 


= (2- 1X2 - 16X2 - 81) (B.3) 
The roots of Eq. (B.3) are 
z=1(= yj) -4———— y,( =x?) «———_ x, 
22=16(= yj -— — — yí = х2) ————— x 


2y= 81 (= уф) —————— y,( = x3) -M——— M x, 
Iteration 2 Iteration 1 
That is, after the second iteration, we can estimate the roots of original 
equation рух) from the relation 
=z} =z}, i=1,2,3 

Suppose we have done the squaring process Ё times and the roots of the 
final equation are г, then 

(B.4) 


Remember that we never have рух) in factored form as given in 
Eq. (B.1), but the result is the same. 
Now, let us consider a third degree polynomial in standard form as 


рфх) = age! + ау? + ax + ay (B.5) 
Then, 
Pi) = -ру-х) р(х) 
= (ах? + арх? + a,x + ag) (аух® - apx? + aix - ау) 
= aĝa" - (а? — азау + (a? — 2agay)x* - ag? 


= bzy? + bay? + by, + bg (B.6) 
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where 
y= 
and 
b3=+a3 
b, = -(a} - 2ayas) 
b,- + (a? Заа) | вл) 
by = ~a,? 
We can thus show that for a general polynomial of degree n, 


рўх) = оз” a, ax +... + ay (B.8) 
after first squaring process, 


6,=+ a? 
b, 7 -(a? – да,а, 2) 


1 
ЕСИ] | 
з | 

| 


| (B.9) 
This process can be repeated replacing ‘a’ values by ‘b’ values in Eq. 


(B.9) each time. Let us suppose that our final equation after & iterations 
(i.e. squaring times) is 
Ву +В, у”! +o + Ву=0 (в.10) 
Assuming that the roots of Eq. (B.10) are now more widely separated, 
we have 


by = 4-17 аў 


ly l>>Ly21>>1¥a] >> In 


Then, 
"Eg 
ЕД B, 
Baa 
У [raw 
PRI 
У. * в, 
That is, 
Bai 
rm PSY ЖЕ 
В, isi 


=(z)" (B.11) 
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B, | 
x, = Pth root of y, = 24 Ё | (B.12) 


The mi^ Avantage of Graeffe's root squaring method over other meth- 
ods is th. it does not require any initial guessing about roots. The 
method is also capable of giving all the roots but the limitation is that 
the polynomial should have only real coefficients. 


EN 1 —. 


Apply root squaring technique to estimate the roots of 
x-a- бу +8=0 


Table below shows the coefficiente of successive polynomials (using Eq. 
(B.T) ав well as the roots estimated (using Eq. (B.12). 


Thus, 


k | Coefficients ДЕ Roots estimated 
о ae ON а | а аа 

0 1 -3 -6 8 

EE -21 84 -64 | 45826 20 0.8729 

2| 1 -23 4368 —4096 | 4.0648 20 0.9841 

3 | 1 -65793 16843008 -16777216 | 4.0020 20 0.9995 


The exact values are 1, -2, and 4. Signs must be determined by substi- 
tuting the estimates in the original polynomial. When we substitute an 
estimated root, if the value of the polynomial is zero, then the root is 
positive; otherwise negative. 


Difference Operators 
and Central Difference 
Interpolation 
Formulae 


БЕТ INTRODUCTION 


In Chapter 9 we bave already discussed briefly the application of finite 
differences for interpolating the function values, Here we consider again 
the finite differences of functions and discuss in detail various operators 
used on them. We also discuss here some of the central difference formulae 
used for interpolation. 


EA FINITE DIFFERENCES 


Suppose we have a function, f(x), whose values are known (or tabulated) 
at a set of points xp, x}, Xa, ..., x,. Let us denote the function values f(x;), 
fixi), s (Gq) by fos Fis ~ fn- The difference between any two consecutive 
function values is called the finite difference. The difference in function 
values 


fei. - Ka) firii (C.1) 
is known as the first forward difference at x = x, We may denote this 
first difference at x = x, as 

P 


(С.2) 


where A is an operator called the forward difference operator. 
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By applying the proces: repeatedly we generate the second forward 
difference, third forward difference, and so on. Thus the kth forward 
difference of Дх) at x = x, is given by 


Га марада (Сз) 


this isat 3161-7 


The difference in fi nction values 


fix) - fa) ба (C.4) 
is known as the first backward difference at x = x,. This is denoted by 
Vh -fi fii] (C.5) 


where V is an operator called the backward difference operator. 
The kth backward difference of Дх) at x = x, is defined by 


PS oO a ENERO 
(V e v7 i a] (C8) 


ky 
Forward and backward differences introduce asymmetry, Sometimes we 
may need formulae which are symmetrical about the points of interest. 
Such formulae are based on central differences. 
The difference in function values 
А сл) 
is known as the first central difference of f(x) at x = x,. This is denoted by 


where 6 is called the central difference operator. 
The Ath central difference at x = x, is defined as 
Of = 88) = 85775, os сэ) 
Note that if the function values are only known at х, x», ...., x,, then 


Fa, , 1) is indeterminate and therefore 6f(z,) is not computable. However, 
we can evaluate ó^f(x,) as follows: 


S = M .з—, лаз) 
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= а-Йй-Ё+Ё-ї 
ahei- Atha (C.10) 
In general, we can compute ô”f, for all positive integers k. 


ES DIFFERENCE OPERATORS 


We have seen three difference operators, namely, forward operator A, 
backward operator V, and central operator 6. We consider here some 
more operators that are often used in the manipulation of interpolation 
formulae. 


Shift Operator 


We have an operator known as shift or displacement or translation 
operator denoted by E. The displacement or shift operator is defined as 


ifia] сс.) 
Ef =й. (C.12) 
If the values of x, are equally spaced such that 


PNLITIT 
then 
xim 
Ец. (C.12) may be written as 


E*fix,) = fix, + kh) (C13) 
Inverse Operator 
An operator opposite of E is known as inverse operator and is defined as 
E^ f(x) = gx) (0.14) 
Note that 
Egíx) = EE! f(x) = fix) 
Egix) = g(x + h) 
Then 
glx + h) = fix) 
Therefore, 
g(x) = fæ- h) 
That is, 
E“f(x) = fix — №) (C.15) 
Similarly, 


E*f(x) = f(x — kh) (С.16) 
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Averaging Operator 
The everaging operator | is defined as 


а= Чап +һл®) (C30) 
That is, 


и) - Mr er -4)] 


per RELATIONS BETWEEN THE OPERATORS 


The difference operators are related to one another in a number of ways. 
We consider here a few of them. 
Relation between A and E 


We know that 
Af) = fix + h) - (Дх) 


= Еро) - LfG) 
-2(E- Df) 
Therefore, 
(C.18) 
Relation between y and E 
Vf) = fix) - f(x - h) 
= 1/(х)- E^ f(x) 
=(1-Е!)/(х) 
Therefore 
У s (C.19) 
or 
Е=(1- уу! 
(since (E 3) = E) 
Relation between A, v and E 
We have 
EY = E1- E) 
=E-1 
=A 
Therefore чи 
EV (C.20) 


Similarly, we can show that 
E'A-V 
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Relation between Е, 6 and ш 


We have 
а= (s t)r) 
= E f(x) – EC? f(x) 
"Therefore, 


ЕЗД =, (С.21) 
Similarly, we сап show that 
| nemen i= 2 "i 
и= gs Е | (€.22) 
From Eqs (C.21) and (C.22), 
uó- 1 (Е - E» 
We know 
A*V-sE-Düu-E'sE-E' 
Then, 
иё= 1(А+ 7) 
2 


A-Vs(E-D-ü- E 


2E-24E! 
AV =(E-1)(1-E") 
=E-2+E" 
Therefore 
A-VzAV 
| Example C.3 | 


Show that A? = Е? - 2E + 1 


A? = (Е- 1)(Е-1) 
= Е®-2Е +1 
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A-V=(E-1)41-E") 


=Е+Е!-2 
Hence, 
F=a-V 
From Example C.2, we know 
A-V=AvV 
and, therefore, 
P=AV 
From Example C.3, we can prove that 
А = Е? - 2Е +1 
= Е(Е- 2+ Е) 


= Е(А- У) = ЕАУ 
Similarly, the readers may attempt to prove the following relations 


L weaned ог §2=4(u2-1) 


=A+V 


[Б] CENTRAL DIFFERENCE INTERPOLATION 


FORMULAE 


We have already discussed in Chapter 9 the forward and backward 
difference formulae. They are useful when the value required is at the 
beginning or at the end of the table. However, if the point of interest is 
located at the middle of the table, then we may use the formulae based 
on central differences. 
Let x; be the middle point and fù the corresponding function value. 

Consider the values of x on either side x, such that 

f, = [же +h), f1 = fito- h) 

fa= fis + 2h), fa = [ощ - 2h) 
and so on. 
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We can now form a difference table with the values f(x) on either side of 
ху, as shown in Table C.1 


Table C.1 Central difference table 


x f First Second Third Fourth 
difference difference difference. difference 
xy 2h fa 
Аба = Uf sy 
t-A fa NAT. 
2 Afi = oF we ШЫЛА 
а h Afa = 89 А102 = 61, 
А/ = ohn Ац = Sfin 
х+А f Аа = Fh, 
АЛ = бм, 
w+ R 


The entries in the table are related using the relation between A and б 
operators. We know that 
ô= ap” 
and therefore we can show that 
Sf. = (AE T fym A? кз, = Afa 

Similarly, we can prove for all the entries. 

We present here the following central difference interpolation formulae 
which use differences close to the centre of the table. 

1. Gauss forward formula 

2. Gauss backward formula 

3. Stirling formula 

4. Bessel formula 

5. Laplace-Everett formula 


Gauss Forward Interpolation Formula 


We have used the Newton-Gregory forward interpolation formula in 
Chapter 9. This is given by 


- - -3) 
FG) fy + pafo + PEED жар, РОСО) ар, (оза 
where 
Xo*ph ог pz* = 


We can show that ' 
А = MEL = А21 + Ду = a4, + a, 
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Sf = AEF, = АФ (1 + Ау, = Afa Mff, 

Ай, = АЧ, + A f, апі so on. 
Substituting for Лу, Afo, .... * :е above equivalents in Eq. (C.23) and 
after simplification, we obtain 


fief. BL (8 Jer; {> 3 era (Para 
| (C.24) 


where 


pem moe 
a]. n! 


Equation (C.24) is known as Gauss forward formula. Note that this 
formula involves 'odd' differences below the centre line and 'even' differ- 
ences on the central line. 

Gauss Backward Interpolation Formula 


Gauss backward formula is obtained using the odd differences above the 
centre line and even differences on the central line. This is obtained by 
using the following substitutions in Eq. (C.23) 


АБ = Af, + Mf, 
Ар = Af, + NE. 
Ау OF + Af, 
А = Afa + Аа 
Аа = Аа + VI 
Thus, we obtain the Gauss backward interpolation formula as 


пољ (Pars (P2 Dara (Ps anra (P2? (ША 
i 
(C.25) 
Equation (C.25) is popularly known as Gauss backward formula. 


Stirling Formula 


Stirling formula is obtained by taking the average of the two Gauss 
formulae. Therefore, adding Eus (C.24) and (C.25) and dividing by 2, we 
obtain 


fG s fo + аа) ыу, 


2 


„РШ? =з +АЗ Р )+ р?(р% -1) 
2 4! 


ЕП Ара +. | (C.26) 
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Equation (C.26) is known as Stirling formula. Note that Eq. (C.26) in- 
volves the means of the 'odd' differences just above and just below the 
central line and ‘even’ differences on the line. To use this formula, p 
must satisfy the condition 


Bessel Formula 


Bessel formula is a modified form of Gauss forward formula, which is 
given below. 


) ( 
8 1 р+1)р(р- Df, 


F(x) = fo + pfo +. Af T - (C.27) 
We know 
Afy-fo-fo and, therefore, fy =f; - Afo (C.28) 
Ад = АЈА, then, А = А AY. (C.29) 
Similarly, 
‘fa = Afa Afa (€.30) 
Now, we rewrite Eq. (C.27) as 
г< fl, påfs + +120720 өүү, duds pu, 
2 3 
sete D MM es (сз) 
Now, substituting Eqs (C.28), (C.29), and (C.30) in (С.31), we get 
f jf, А-А sie ЛА ong, + ign afa 
" D nf, Авр TR Bur 
- 2 2 
-rh «(р- 2) VERD D (A? fo P fo) 
(нев 
pu em m UM Esas (C.32) 


Equation (C.32) is called Bessel formula. Note that the Bessel formula 
involves odd differences below the centre line and averages of the even 
differences on and below the central line. 
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Observe that when p = i all odd order differences vanish and we get 


2 2 4 
= 0+ 0-5 (EET fxs Bj. rera fra) 


5 (Sf. ofa 
а (C.33) 


7 1024 2 
Equation (C.33) is known as formula for interpolating to haves. 


Laplace-Everett Formula 
Again consider the Gauss forward formula 


(p*Dpp-D 
FG) fo + pMo + eee 


if we eliminate the add dili from Eq. (C.34) the result will give 
us the Laplacc-Everett formula. We snow 


mo- D aari AS +... (0.34) 


А=ћ-А 
Ау = 3% fa 
Af am АЧ Аа 


and во оп, 
Substituting these in Eq. (C.34) and rearranging the terms, we get 


коа аљ «(7 н) аз fa Pot) sary 


ТА (PA) жул» (C.38) 


Letting 
p=l-q or l-p=q 
and simplifying, Eq. (C.35) becomes 


fc [ws «(75^ Jota +. } 4 ( PULLS (C.36) 


Equation (C.36) is known as Laplace-Everett formula. This formula in- 
volves only even differences on and below the central line. This can be 
used when 0 « p « 1. 
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Averaging operator Forward difference operator 
Backward difference Gauss backward formula 
Backward difference operator Gauss forward formula 
Bessel formula Inverse operator 

Central difference Laplace-Everett formula 
Central difference operator Shift operator 
Displacement operator Stirling formula 


Forward difference Translation operator 


С Programs 
————— 


Bas INTRODUCTION 


C programs are basically made of functions. As we know, a function is a 
unit of a program that performs a particular task. A C function is similar 
to a subroutine in FORTRAN, A function begins running when its name 
is used in the program. 

АП C programs need a main function which is named as main() 
Therefore, the programs in this appendix begin with the main function. 
Since the main function does not use any arguments, its name is followed 
by a set of empty parentheses. 

Everything inside a C function is enclosed between two eurly brackets, 
like {...). The left curly bracket signals the beginning of the function and 
the right one signals the end of the function. 

Some features of C that are distinctly, different from FORTRAN are 

* C is a free-form language and therefore we may follow any forms 
of statements that may please us. 

A comment or remark can be inserted anywhere that a space can 
appear in a C program provided it is preceded by /* and followed by 
sj 


* As a convention (and not as a rule), everything in C is written 
using lower-case letters. Thus, names of all variables and functions 
are written in lower-case. 

АП C statements must end with a semicolon. 

C does not support statement numbers. However, we can assign a 
label to a statement as follows: 


begin : sum = 0.0; 
We may direct the control to this Statement by using the goto 
statement as 
goto begin; 
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С language does not have STOP and END statements. While the 
closing bracket } represents the “END” of a function, STOP may be 
replaced by invoking the exit() function available in C. 

Unlike FORTRAN, all variables and functions that return values 
must explicitly be declared for their types. 


int i,j,a,b; /* declaring integer variables */ 
float x,y,a[10]; /*declaring real variables */ 


C supports what are known as preprocessor directives #include 
and #define. The include directive is used to include in the 
program library functions and other files, We may use the #define 
directive to define constants and functions that are used in the 
program. 
C does not support any operator for exponentiation. Instead, it uses 
a function pow(x,y) to compute x”. 
C uses square brackets to represent arrays variables, like, ali]. 
ЫЛ), ete. 
Unlike FORTRAN, C array elements are numbered from ZERO, 
not ONE. That is, the array a[3] will be represented in the memory 
as a[0], a[1], and a|2]. 
C uses an operator & known as address operator to read the values 
from the keyboard. We may use this operator to obtain the address 
of a variable in the memory. For example, & x gives the address of 
the location of x. 
C defines % as the modules operator. This operator divides the first 
operand by the second and gives the remainder, (not the quotient) 
ав the result. 
C defines another operator known as "star" operator * to define a 
pointer. The statements 

int a; 

int *p; /* p is declared as pointer */ 

р=&а; /* address of a is assigned to p */ 
make p point to а: Now, the statements *p45 and a45 both would 
give the same result. That is, *p means the value stored at the 
address pointed to by p. 
C uses the following relational and logical operators: 

== is equal to 
< is less than 
> is greater than 
is less than or equal to 


2 
>= is greater than or equal to 
!= is not equal to 
i 
& 


1 logical OR 
logical AND 


С Programs $33 


Chas two more contro] statements continue and break in addition 
to the conventional goto statement. These two are used inside a 
loop. While the break transfers the control to the first statement 
outside the loop, the continue statement skips the remaining part 
of the loop. 


Bz FORTRAN TO C CONVERSION 


The FORTRAN programs in the text have been translated almost line 
by line using C equivalents given in Table D.1 


Table 0.1 С equivalents 


FORTRAN Statements C Equivalents 
INTEGER Tint Е 
REAL float, double 
PARAMETERS (M = 100) #define M 100 
INTRINSIC #include <math.h> 


READ(*,*) statement 
WRITE (*,*) statement 
Function subprograms 


scanfl) statement 
printf) statement 
Preprocessor macro definition 


Statement number Label name 
GOTO <number> goto «label» 
GOTO inside a loop break or continue 
STOP inside a program exito funetion 

IF (...) THEN.. MC sass 
IFC.) THEN ... ELSE if G3 
DO ... CONTINUE for C.) 
END ) 


The preprocessor directives define and #include are always placed 
before the main() function. Any constants and variables declared before 
the main fanction are considered to be global and are available to all th 
functions defined after that point in the file. 

It should be noted that the programs have been translated line by line 
so as to enable the readers to understand the logic easily. It is quite 
possible to simplify or improve the efficiency of C programs in this 
appendix using some of the unique features of С. This requires a complete 
understanding of the language. 

Readers new to C language must read an introductory book on € such 
as Programming in ANSI C by E Balagurusamy before attempting to 
analyse and apply the C programs given in this appendix. 
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Table 0.2 List of programs 


Program No. Name Description 


І POLY Program POLY evaluates a polynomial of degree 
n at any point x using Horner's rule. 


2 BISECT This program finds a root of a nonlinear equa- 
tion using the bisection method. 

3 FALSE This program finds a root of a nonlinear equation 
by false position method. 

4 NEWTON This program finds a root of a nonlinear equation 
by Newton-Raphson method. 

5 SECANT This program finds a root of a nonlincar equation 
by secant method. 

6 FIXEDP This program finds a root of a function using the 
fixedp point iteration method 

1 MULTIR The program finds ull the real roots of a polyno- 
mial. 

8 COMPR This program locates all the roots, both real and 
complex, using the Bairstow method. 

9 MULLER’ This program evaluates root of a polynomial us- 
ing Muller's method. 

10 LEG! This program solves a system of linear equations 
using simple Gaussian elimination method. 

п LEG2 This program solves a system of lincar equations 
using Gaussian elimination with partial pivoting, 

12 DOLIT This program solves a system of linear equations 
using Dolittle LU decomposition. 

13 JACIT This program uses the subprogram JACOBI to 
solve a system of equations by Jacobi iteration 
method. 

14 srr This program uses the subprogram GASEID to 
solve a system of equations by Gauss-Seidel 
iteration method. 

15 LAGRAN This program computes the interpolation value 
at a specified point, given а set of data points, 
using the Lagrange interpolation representation. 

16 NEWINT This program constructs the Newton interpolation 
polynomial for a given set of data points and 
then computes interpolation value at a specified 
value. 

17 SPLINE This program computes the interpolation value 


at a specified value, given a set of table points, 
using the natural cubic spline interpolation. 


18 LINREG This program fits a line Y = A + BX to a given 
вет of data points by the method of least squares. 
19 POLREG This program fits a polynomial curve to a given 
set of data points by the method of least squares. 
20 NUDIF* This program computes the derivative of a tabu- 


lated function at a specified value using the New- 
ton interpolation approach. 


(Contd.) 


Table D.2(Contd.) 
Program No. * Name 


Description 


21 


30 


ТКАРЕІ 
SIMS1 
ROMABRG 
TRAPE2* 


SiMS2* 


EULER 


HEUN 
POLYGN 


RUNGE 


MILSIM 


This program integrates a given function using 
the trapezoidal rule. 

This program integrales a given function using 
the Simpson's 1/9 rule. 

This prog om porfarms Romberg integration by 
bisecting the intervals N times. 

This program ir’ зтоёсз а tabulated function us- 
ing the trapezi 
This program a tabniated function us- 
ing the Simpson's 1/3 rule. It the number of seg- 
ments is odd, the trapezoidal rule is used for the 
last segment. 

This program estimates the solution of the first 
order differentia! equation у' = fix, y) at the 
given point Euler's method. 

This program solves ihe first order differential 
equation у” = f(a, y) usine the Heun's method, 
This program soives the differential equation of 
type y' = Го, y) by polycon method. 

This program computes the solution of first order 
differential equation of type у“ = ftx, y) using 
the 4th order Runge-Kutta method. 

This program solves the first order differential 
equation y' = fix, у) using the Milne-Sim 
method. 


The 


Ж ттт programs in © and ere not available in FORT 
the text, Readers may try to develop FORTRAN er 


Program 1 POLY 


pee 


* Main 


ас any point X 


program 
Program POLY eval 


pnvalents of these prez 


uates а pi 
ing Hor 


rule 


* Subroutines used ^ 
* horner " 
ава ce asm Sici — * 
* Variables used i 


* = 


NIL 


Degree of 
Array of 
Pcint of 
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main( ) 

{ 
int n,i; /* Declaration of variables */ 
float x,p,a[10]; 
float horner(int n, float a(], float x); 


printf(*Input degree of polynomial, nin*); 
scanf("*$d*, &n); 
printf(*Input polynomial coefficients a(0) to a(n) \n*); 
Ёог(1=0;1<=п;1++) 

зсапЁ(*%#”, kalil); 
printf (“Input value of x (point of evaluation) Mnt); 
scanf("&f*, &x); 


/* Evaluating polynomial at x using Horner's rule */ 
р = horner(n,a,x); /* Calling the function horner */ 


/* Writing the result */ 
printf(*in*); 
printf(*f(x) = &f at x = %? Wn*, P. x); 
printf (*\n"); 


/* End of main program POLY 


/* Defining the function horner */ 
float horner(int n, float a[], float x) 


/* horner computes the velue of a polynomial of order n 
at any given point x. */ 


int i; /* Local variables */ 
float p; 


p = aln]; 
for(i=n-1;i>=0; 


‹ 


р = p*x + alil; 


1 
return (p); 
) 
/* End of Function horner */ 
ўж = 


Program 2 BISECT 
у+-----------------------------------------------=—- * 
* Main program E 
* This program finds a root of а nonlinear * 
* equation using the bisection method E 
T . 
^ Functions invoked * 
* Macro Р(х) + 
il SO нак ады м 
* Subroutines used i. 
+ bim * 
Л — . 
* Variables used » 
* & - Left endpoint of interval * 
* b - Right endpoint of interval = 
ы s - Status * 
en root - Final Solution kd 
^ count - Number of Iterations done ы 
* Constants used Y 
е EPS - Error bound * 
арлыгы TaN 


NA 


#iaclude 

fidefine EPS 0.000001 
define Е (х) (х) ч (ж) + (х) -2 
main( ) 


C 


int 8, count; 

float a,b,root; 

printf ("An"); 

printf("SOLUTION BY BISECTION METHOD \п"); 

printf£("\n"); 

printf (“Input starting values \n"): 

scanf("tf tf*,&a,&b); 

/* Calling the subroutine bim() */ 

bim(&a, &b, &root, &s, &count); 

if(s==0) 

t 
printé(*\n"); 
printf(*Starting points do not bracket any root \п*); 
ргіпце(“ (Check whether they bracket EVEN roots \п"}; 
print£(™\n" 
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else 

‹ 
printf("\nRoot = *f үп", root); 
printf(*F(root) = %£ \n*, F(root)); 
printf(*Wn*); 
printf(*Iterations = td Wn^, count); 
printf (*\n"); 

} 

2 


——!* nd of main program */ 


3" nous 


/* Defining the subroutine bim() */ 
bim(float *a, float *b, float *root, int *s, int *count) 


/* This subroutine finds a root of nonlinear equation 
in the interval [a,b] using the bisection method */ 
{ 
float x1,x2,x0,£0,f1, £2; 
/* Function values at initial points */ 


Xl = эг; 
x2 = *b; 
£1 = F(x1); 
£2 = Fix2); 


/* Tesi. if initial values bracket a SINGLE root */ 
if(fl*f2 > 0) 
t 
*s = 0; 
return; /* Program terminated */ 
} 
е1зе 
'* Bisect the interval and locate the root iteratively */ 
( 


*count = 0; 


begin: /* Iteration begins */ 
х0 = (xl + x2)/2.0; 
£0 = F(x0); 
if(f0--0) 
{ 
*з = 1; 
*Yoot = х0; 
return; 
) 
if(fl*f0 < 0) 
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{ 
X2 = x0; 
} 
else 
4 
xl - x0; 
fl - £0; 
} 


/* Test for accuracy and repeat the process, if necessary */ 
if (fabs ((x2-x1)/x2) < EPS) 


1 
*s = 1; 
*root = (x1«x2)/2.0; 
return; /* Iteration ends */ 
) 
else 
S 
#count = ‘count + 1; 
goto begin; 
H 
) 
) Я 
/* End of subroutine bim() */ 
нн —— „єў 
Program 3 FALSE 
dj йынан ннен lam Ra ase een Ree eRe * 
* Main program " 
» This program finds a root of a nonlinear equation 
Ж by false position method * 
РОР Me Dea SA ob itp i IES TROUT, ИНЕ . 
* Punctions. invoked * 
* — Macro F(x) E 
rmi —————— * 
* Subroutines used * 
* falt) * 
* * 


* Variables used * 
= a - Left endpoint of interval е 
* b - Right endpoint of interval * 
* s - Status * 
z root - Final solution ja; 
* count - Number of iterations completed * 


* Constants used * 
* EPS - Error bound * 
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finclude <math.h> 


fdefine EPS 0.000001 
fdefine F(x) (x)* (x) *(x)-2 
main( ) 

t 


int s,count; 
float a,b,root; 
printf (*\n*); 
printf("SOLUTTON BY FALSE POSITION METHOD \п*); 
printf(*in*); 
printf("Input starting values Wn*); 
scanf (“$f $f*, &a, &b); 
/* Calling the function fal() */ 


fal (&a,&b,&s8,&root, &count) ; 


printf(*in*); 

printf("Starcing points do not bracket 
any root \n"); 

printf("An*); 


3 
eise 
{ 
printf ("\n*); 
printf("Root = %Ё n*, root); 
printf (*F(root) = *f n*, F(root)); 
printf ("\n NO. OF ITERATIONS = 8d \п”, count); 
) 
) 
/* End of паїп() program */ 
^ ge Np ЕЕ —— на. Й 


/* Defining the subroutine fal() */ 
fal(float *a, float *b, int *s, float *root, int *count) 
/* fal finds a root of a nonlinear equation */ 


г 
float х1,х2,х0, #0, Ғ1, £2; 


хі = *а; 
х2 = *b; 
fl = Pixi); 
#2 = Е(х2); 


/* Test if a and b bracket а root */ 
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if(fl*f2 > 0) 


{ 

T 

return; /* Program terminated */ 
) 
eise 
t 

printf(* n xl x2 An"); 
) 
*count = 1; 
begin: /* Iteration begins */ 
х0 = x1-£1*(x2-x1)/(£2-£1); 
#0 = Р(х0); 
if(fi*f2 « 0) 
Í 

x0; 
£0; 

} 
else 
i 

xl - x0; 

fi = f0; 
} 


printf("$5d $15.6f $15.6f Xn", “count, xl, x2); 


/* Test whether the desired accuracy has been achieved */ 


) 


if(fabs((x2-x1)/x2) « EPS) 
{ 


accu 
*root (x1«x2)*0.5; 
return; /* Iteration ends */ 
ў 
else 
{ 
*count = *count + 1; 
goto begin: 


/* End of subroutine fal() */ 


7% = Ы; 

Program 4 NEWTON 

Gd жешнен ннан QM À— nen " 
*Main program * 


This program finds a root of a nonlinear equation 
by Newton-Raphson method * 


*Functions invoked Ы 


* 


Macros Р(х), FD(x), Library function fabs() * 


*Variables used И к 
x0 - Initial vale of x s 
xn - New value of x * 
* 
^ 


fx - Function value at x 
fdx - Value of function derivative at x 
count - Number cf iterations donc 


Constants used # 
EPS - Error bound я 
MAXIT - Maximum number of iterations permitted я 


include <math.h> 

define EPs 0.000001 
#define MAXIT 20 

#define F(x) (х)*(х)+(х)-2 
$define FD(x) 2* (х) +1 


main( ) 


{ 


int count; 
float x0, xn, fx, fdx; 


printf ("\n*); 

printf("Input initial value of x in"); 

scant("$f', &x0); 

printf ("\n"); 

printf(*" SOLUTION BY NEWTON-RAPHSON METHOD Wn"); 
printf (*\n*); 


count = 1; 

begin: /* Iteration begins */ 
fx = F(x0); 

fdx = FD(x0); 


xn = x0 £x/fdx; 


if(fabs((xn-x0)/xn) < EPS) /* Iteration ends */ 
t 
printf ("Root = %f \n*, xn); 
print£ ("Function value = %f \n", F(xn)); 
printf ("Number of iterations = Фа in", count); 
print£(*\n"); 
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е1ве 


х0 = xn; 
count = count + 1; 
if (count < MAXIT) 


{ 
goto begin; 

1 

else 

t 
printf("\n SOLUTION DOES NOT CONVERGE \n*); 
printf(*IN $d ITERATIONS Wn*, MAXIT); 

) 


) 


H 
/* End of main() program */ 


* Main program 


a This program finds a root of a nonlinear є 
* equation by secant method * 
йа giangan Ё 
* Functions invoked ы 
* Macro F(x) ‚> 
* * 
* Subroutines used * 
+ зес() s 
B 

* 

* + 
* b- Right endpoint of interval * 
* х1- New left point * 
* x2- New right point ыз 
* root- Final solution * 
* count - Number of iterations completed * 
Constants used + 
EPS - Error hound ы 


$include <math.h> 
#define EPS 0.000001 
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fdefine MAXIT 50 
*define F(x) (ж) * (x) + (х) -2 


main( 


t 


J 
/* Bnd of main() program */ 


) 


float a,b,root,xl,x2: 
int count,status; 


printé(*\n") ; 
printf(* SOLUTION BY SECANT METHOD \п*); 
printé("\n*); 

printf(*Input two starting points \п”); 
scanf(*$f $f*, ка, &b); 


Sec(ka, &b, &Xl, &x2, &root, &count, &status); 


if(status -- 1) 

{ 
printf ("\n"); 
printf (* DIVISION ЗҮ ZERO n"); 
printf(*MnLast xl = $Ë \n*, x1); 
printf(*YnLast х2 = $f in^, x2); 
printf(*WnNO. OF ITERATIONS = $d Wn*, count); 
printf ("\n*); 

) 

else if(status -- 2) 

{ 
printf ("\n"); 
Printf("NO CONVERGENCE IN &d ITERATIONS \n*, MAXIT); 
printf (*Ar*) ; 

) 

else 

{ 
printf ("\n"); 
printf(*Root = %Ё \n", root); 
printf (“Function value at root = èf \п”, Flroot)); 
printf(*in*); 
Printf("NO. OF ITERATIONS. = $d Wn', count); 
printf("in*); 

) 


dup 


/* Defining subroutine sec() */ 


sec(float *a, float *b, float *x1, float *x2, float 


*root, int *count, int *status) 


"n 


ye 


je 


ye 


n 
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This subroutine computes a root of an equation using 
the secant method. */ 


float x3,f1,£2,error; 


Function values at initial points */ 


*xl = жа; 
*х2 = *Ь; 
fl = Fl*a); 
f2 - F(*b); 


Compute the root iterativoly */ 
*count = 1; 


begin: /* Iteration process begins */ 
if(fabs(f1-f2) <- 1.E-10) 
{ 

“status = 1; 

return; /* Program terminated */ 


+ 
X3 = *x2 - f2*(*x2-*x1)/(f2-f1); 
error = fabs((x3-*x2)/x3); 


Test for accuracy */ 
if(error > EPS) 
á 


Test [or convergence */ 
if (*count MAXIT) 


{ 


*status = 2; 


return; /* Program terminated */ 
) 
else 
t 
*Xl = *х2; 
) 
= x3; 
Ё2: 
P(x3); 


*count = *count +°1; 
goto begin; /* Compute next approximation */ 


*root = xl; 
*status - 3; 
return; /* Iteration ends */ 
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) 


' 
/* End of subroutine sec() */ 


+ -+ 
* Main program 


^ This program finds а root of a function using * 
* the fixedp point iteration method * 
Желе" киен " 
* Functions invoked A 
* Library function fabs() and macro G(x) T 


* 


x0 - Initial guess 
x - Estimated root 


* 

* 

* Variables used 

* error - Relative error 


*Constants used 
* EPS - Error bound 
* MAXIT - Maximum iterations allowed 


"7911211 


4 include «math.h» 
# define EPS 0.000001 
# define G(x)  2.0-(x)* (x) 


main( ) 

б 
int MAXIT, i; 
float x0, x, error; 


printf (*\n SOLUTION BY FIXED-POINT METHOD \n"); 

print£("\n"); 

print£(*Input initial estimate of a root \n"); 

scanf("%£", &x0); 

printf ("Maximum iterations allowed \n"); 

Scanf(*$d", &MAXIT); 

printf(*Wn ITERATION VALUE OF X ERROR Mn"); 
/*Iteration process begins*/ 

for (4=1;i<=MAXIT; i++) D 

( 


x = 6(х0); 


Ж 
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4rror = fabs((x-x0)/x); 
printf("$10d 310.7f $10.7f \n*, i, x, error); 
if (error « EPS) 
goto end; *Iteration process ends*/ 
eise 
х0 = x; 
} 
printf("\nProcess does not converge to a root.\n"); 
printf(*Exit from iteration loop. Wn"); 
end: 


) 
/* End of main() program*/ 


Program 7 MULTIR 
m 2 2% 
* Main program * 
The program finds all the real roots of a polynomial * 
Fone * 
* Functions invoked T 
боз * 
FP crea —— нады —— М 
* Subroutines used * 
* newton() Е 
* dflet(]) "i 
Ф ж» * 
а 


Variables used i 
* n Degrec of polynomial = 
* a - Polynomial coefficients A(N+1) * 
* x0 - Initial guess ж 

хг - Root obtained by Newton method * 
* root - Root Vector Ў 
* status - Solution status ii 
* 


* Constants used 
EPS - Error bound 
MAXIT - Maximum iterations permitted 


t» 


#include <math.h> 
#define EPS 0.000001 
#define MAXIT 50 


void main( ) 
i 
int n, status, i, j; 
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float a[11], root[10], x0, xr; 


void dflat(int n, float a[11], float xr); 
void newton(int n, float а[11], float x0, int *status, 
float *xr); 


printf(*in*); 

printf ("\n EVALUATION OF MULTIPLE ROOTS \n‘); 
print£("\n"); 

printf("*Input degree of polynomial, n. n"); 
scanf(**td*, &n); 


printf("\nInput poly coefficients, а(1) to а(п+1). \n*); 
for(i = 1; i <= ned; is) 

scanf("*t*,  &ali] 
printf(*\nInput initial guess of x in*); 
scanf ("$f',  &x0); 
printf (*\n"); 


/* Process of root searching begins */ 
for (i=n;i>=2;i--) 
/* Find ith root by Newton Method */ 


newton(n,a,x0,&status,&xr); 


if(status == 2) 
{ 
for(j=n;j>=itl;j--) 
printf(*root $d = %f in*, 


zoot[j]); 


printf("\nNext root does not converge in Wi"); 
printf("$d iterations An", MAXIT); 
printf(*Wn*); 

goto end; /* processing ends */ 


) 
rootli] = ж; 


/* Deflate the polynomial by division (x-xr) */ 
dflatí(n,a,xr): 
х0 = xr; 
/* Proceed to next root */ 
) /* End of for loop */ 


/* Compute the last root */ 
root{1] = -а[1]/а[2]; 


/* Write results */ 
printf£("\n ROOTS OF POLYNOMIAL ARE: \n“); 
printf(*An*); 
for ( pi<=n;i++) 
printf ("ROOT $d = &f \n", i, rootli]); 
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printf (*\n"); 
end: 
printf ("END"); 
1 
/* End of main{) program */ 


/* Defining the subroutine newton() */ 


void newton(int n, float a[1ll], float x0, int *status, 
float *xr) 


/* This subroutine finds a root of the polynomial usiug 
the Newton-Raphson method */ 


int i, count; 
float fx;  /* Value of polynomial at x0 */ 
float fdx; /* Value of polynomial derivative at x0 */ 


count = 1; 
/* Compute the value of function at x0 */ 
begin: 
fx = а[п+11; 
for(isn;i =m} 
fx = fx * х0 + ali]; 


4* Compute the value of derivative at x0 */ 
fdx = а(п+1] * n; 
forlisnri?-2;i--) 
fdx = fdx * x0 + ali) * (1-1); 


/* Compute à root xr */ 
*xr =  x0-fx/fdx; 


/* Test for accuracy */ 
if (fabs ( (*xr-x0)/(*xr)) <= EPS) 
{ 
‘status = 1; 
return; 


} 


/* Test for convergence */ 

if (count <MAXIT) 

{ 
х0 = *xr; 
count = count + 1; 
goto begin; 

) 

else 

I 
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*status = 2; 
return; 


} 
/* End of subroutine newton() */ 


— ыныр ныр easi +, 


Prem 
/* Defining the subroutine dflat() */ 
void dflat(int n, float a[11], float xr) 


/* his subroutine reduces the degree of polynomial by 
one using synthetic division */ 
Y 
float b[11]; 
int i; 
/* Evaluate the coefficients of the reduced polynomial */ 
bine] = 0; 
for(i-n;iz-l;i--) 
bli] = ali«1] + xr * blisl]; 
/* Change coefficients from b array to a array */ 
for(i-l;i«-n«l;ies) 
afi] = Ыі; 
) 


/* End of subroutine dflat() */ 
/*- --- */ 
Program 8 COMPR 


* Main program ж 
* This program locates all the roots, both real * 
* and complex, using the Bairstow method * 


ынаа 


* Functions invoked * 
* NIL s 
F iip ESEO a etr riti * 
* Subroutines used Е 
* input, bstow, quad, output * 
* * 
* Variables used Ы 
* n - Degree of polynomial * 
* a - Array of coefficients of polynomial Ыы 
* u0,v0 - Initial values of coefficients of the * 
* * 


quadratic tactor 
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* Чу - Computedívalues of coefficients of the * 
* quadratic factor * 
* b - Coefficients of the reduced polynomial Ы 
*  xl,x2 - Roots of the quadratic factor # 
* type - Type of roots (real,imaginary or equal) * 
VPE CUN: o aao ORE epis, e ie aa * 
* Constants used * 
* EPS - Error bound А 
* 


include <math.h> 
#define EPS 0.000001 
#define image i 
#define equal 2 
#define unequal 3 


main( ) 
{ 
int n, i; 
float а[11], b(11], u0, v0, и, v, х1, x2, 40, dl, 
02, root, type, status; 


printf(*Wn*); 
print£(* EVALUATION OF COMPLEX ROOTS in"); 
printf(*"in*); 
/* бес input data */ 
printf("Input degree of polynomial, n \n"); 
scanf ("&d",&n) ; Р 
printf("W: Input coefficients а(п+1) to a(l) \n"); 
for(i=ntl1;i>=1;i--) 
scanf(^$f*,&a(i]); 
printf("\n Give initial values u0 and v0 \n"); 
scanf (“$f $&f*,&u0,&v0); 
begin: 
if(n > 2) 
{ 


/* Obtain а quadratic factor */ 


bstow(n,a[11],b[11],u0,v0, &u, &v, &status) ; 


if(status == 1) 
t 
d2 = 1; 
dl = -u 
do = -v; 
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} 


( 
printf("\n No Convergence іп 100 iterations in*); 


goto end; 

) 

/* Find roots of the quadratic factor */ 
quad(d2,d!,d0,&xi,&x2,&type); 


/* Print the roots */ 
output (n, type, x1, x2) ; 


/* Set the coefficients of the factor polynomial */ 
n = n-2; 
for (i=1; i<=n+1; i++) 
afi] = bli+2]; 


/* Set initial values for next quadratic factor */ 


u0 = u; 
v0 = v; 
goto begin; 
} /* endif */ 
if(n, == 2) /* polynomial is quadratic */ 


quad(al3],al2],a[1],&xl,&x2,&type); 
output (n, type, x1,x2) ; 


) 
else /* last root of an odd order polynomial */ 
t 
root = -а[1]/га[21; 
printf("\n"); 
print£("Final root = tf An", root); 
print£("\n"); 
) 
end: 


printf ("End"); 


/* End of main() program */ 


n 


Defining the subroutine bstow() */ 


bstow(int n,float a[11],float b[1i].float u0,float v0, 


n 


float *u,float *v,float *status) 


This subroutine finds the quadratic factor using 
multivariable Newton's method and also finds the 
reduced polynomial */ i 
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' 
float d,delu,delv,c[11]; 
int count, i; 


count = i; 
begin: 
Ъ[а+1] = а[п+1]; 
bin] = a(n) + u0*b[ni1]; 
for(isn-l;i>-1;i-~) 
b[i] = ali] + uO * Ь[4+1] + vO * b[is2]; 


с[п+1) = 0; 
с[п] = b[n«1]; 
for(i-n-l;is-1;i--]) 
Гі] = blitl] + uO * c[is1] + vO * c(i«2]; 


Я = с[2] * c[2] - с + cl3}; 

delu = -Ib(2] * с[2] - bl1] * c[3])/d; 
delv = -(bI1] * с[2] ~ bI2] * c[21)/d: 
*u = u0 + delu; 

*v = vO + delv; j 


if(fabs(delu/*u) <= && tabs(delv/*v) <- EPS) 
( 


*status = 1; 
return; 
) 


if(count « 100) 
{ 


vO = *v; 
count = count + 1; 
goto begin; 

) 

else 

( 
*status = 2; t 
return; 

) 

) 


/* End of subroutine bstow(] */ 
P c-—«—————— ез. */ 
/* Define the subroutine quad{) */ 


quad(float a,float b,float c,float *xl,float *x2,float 
*type! 


554 Numerical Methods 


/* "This subroutine solves a quadratic equation of type 
alx*x) + bx + c */ 


float а; 


q = bth - d*a'c; 


« 0.0) j* roots are comple: '/ 


w 


q)1/ (2*2); 


/* roots are real and steal */ 


*xl = -b/i2*8; 
0 = zi; 
‘type = «gv 


alae /* rocts ere seel and unequol */ 


? 
retur; 
) 
#* End cf subreutine madi} */ 


/* Defining output{) routine */ 
g 


output(int n, int type, float x-, 


/* This subroutine displays the roots of the quadratic 
equation */ 


printf("Wn*); 
printf("Reats of quadratic factor at n = за Wis, т) 
printt ("Mn"); 


if(type image) 


{ 
print’ (*Reoth 
prirtfi"zoot2 


Nor, xl, x2); 
а am, к] 


printf("Rootl = $f \п', xl); ; 
printf(^Root2 - 

) 

else /* Type == unequal */ 
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{ 
printf(*Rootl ='%f \n*, х1); 
printf ("Root2 = $f \n*, x2); 
) 


return; 


) 
/* End of subroutine output() */ 


* Main program ч 
* This program evaluates root of a polynomial using * 
уа Muller's method » 
s * 
* Functions invoked * 
* NIL 
аа a © 
* Subroutines used м 
* F(x) à 
(auum ide a decidat acria d Rie cR EE CE * 
* Variables used * 
"i х1,х2,х3 - initial values ui 
, f1,f2,f3 - function values at х1,х2,х3 » 
х a0,al,a2 - coefficients of quadratic polynomial * 
* hi - xi-x3 Ж 
" di - function difference fi-f3 Ф 
* Constants used ы 
ui EPS - Error bound па 
PS En eee ———— ВАЙ 


#include <math.h> 
#define EPS 0.000001 


main( ) 

( 
float F(float x): 
float x1,x2,x3,x4,£1,£2,£3,£4,h1,h2,d1,d2,a0,a1,a2,h; 
printf("AnlInput three initial points Wn"); 
Scanf(*&f ¥£ $f", &xl,&x2,&x3); 


fl = F(x1); 
f2 F(x2); 
f3 = Fix3); 
begin: 


hl = x1-3; 


d2 = £2-£3; 

/* Compute parameters a0,al,a2 */ 
ad = £3; 

al = (d2*hl*hl-dl*h2*h2)/(hl*h2* (n1-h2]); 


a2 (d1*h2-d2*h1] / (h*h2* (h1-h2]); 


/* Compute h */ 
if(a120.0) 


h-(-2.0*a0) /(al+sart (a1*al-4*a2*a0)); 
else 
he (-2.0*a0) / (a-sqrt (a1*a1-4*a2*a0)) ; 
/* Compute х4 and f4 */ 
х4 = x3+h: 
£4 = Р(х4); 
/* "est for accuracy */ 
if(f4esEPS) /* root obtained */ 
1 


ргіле# (“\п\пноот BY MULLER'S METHOD\n\n"); 
printf (“Root=%f\n", х4); 
printé("\n"); 

) 

else 

{ 


} 
/* End of main() program */ 
JO Bo eee Seren sen ocean M A зыш===з=ет=е. 


/* Detining the subroutine F(x) ”/ 
float F(float x) 


{ 
float f; 
f = x*xixe2*x*x410*x-20; 
return (f); D 
} 


/* End of subroutine () */ 
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Program 10 LEGI 
fs * 
* Main program * 
* This program solves a system of linear equations * 
* using simple Gaussian elimination method * 
* 
* Subroutines used # 
^ — GAUSSl * 
bni А anisi 
* Variables used м 
bi n - Number of equations in the system Ф 
+ a - Matrix of coefficients * 
* b - Right side vector * 
* x = Solution vector x 
£f 


int status,n,i,j; " 
float a[10][10], b[10], xI10]; 


printf("in SOLUTION BY SIMPLE GAUSS METHOD 1n"); 


printf("What is the size of the system (n)? Win"); 
ѕсапё(“%8°, &n); 


printf("Input coefficients а(і,ј), row-wise, Wn"); 
printf(*one row on each line. in*): 
Ғог(і=1;і<а 
for (ј=1;ј<= 
всапЁ(“%Ё”, Ба[1] (31): 
printf("AnInput vector b Xn"); 
for (i=l;i<=n;i++) 
Scan£("§f", &b[i]); 


/* obtain solution by simple Gauss elimination method */ 
/* call the subroutine gaussi() */ 
gaussi (n,a,b,x,&status); 


if(status != 0) 
t 
printf(*\nSOLUTION VECTOR X \n"); 
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for (1=1;і<=п;і»+) 
printf(*810.6f*, x{i}); 
printf (*\n"); 
1 
else 
( 
printf (“Singular matrix, reorder equations. Ха) 
} 
1 
/* End of main() program */ 


СУ) 


/* Defining subroutine gaussi() */ 


gaussl(int n, float a[10][10], float b[10), float x[10], 
int *status) 


/* This subroutine solves a set of n linear equations 
by Gauss elimination method */ 

( 
int i,j,k; 
float pivot, factor, sum; 

f наа Elimination begins -----.--.--. */ 


for (k=1;k<=n-1; k++) 
{ 
pivot = a[k][k]; 
if(pivort < 0.000001) 
{ 
*status = 0; 
return; 
) 
“status 
forti = 


factor = a[i][k] ‘pivot; 
for (j=k+1;j<=n;j++) 
{ 
alilij] = ali][j] - factor * alk] {j}; 


} 
bli] = bli] - factor + bik); 


fe enna Back substitution begins ------------- ay 


xin] = bin] / а[п}[п]; 
for (k=n-1;k>=1;k--) 
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sum = 0.0; 
for (j=k+1;j<=n; j++) 
sum = sum + a[k]I3] * х131; 
x(k] = (ЫК) - sum) / alk) к} 
) 
return; 


Program 11 LEG2 

4t 

Main program 
This program solves a system of linear equations * 
using Gaussian elimination with partial pivoting 


^ 
B 
Functions invoked " 
` 


Subroutines used 
Gauss2 


Variables used а = 
Number of equations 
a Coefficients matrix je 
b - Right side vector * 
x - Solution vector * 


* Constants used 
+ NIL * 


int 1.3.0; 
float a[10][10], bí10), x[10]; 


print£(*\n GAUSS METHOD WITH PARTIAL PIVOTING Wn*]; 


printf(*AnWhat is the size n of the system? Wn" 
scanf("td", &n); 
printf(*WnlInput coefficients a(i,j), row-wise \n" 
printf(*one row on each line WMn*); 
for(i 
for(j 
scanf(*$f*,&a[i]0j]1): 
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print£("\nEnter vector b Wn*]: 
for (i=1; i<=n;it+) 
всапЁ("%@"”, &b[i]); 
gauss2 (n,a,b,x); 
printf("Wn SOLUTION VECTOR X \n"); 
printf(*in*) 
Еог(1=1;1 i++) 
printf("Nc $f*, x[i]); 
print£(*\n"); 


p 
/* End of main() program */ 


/* Defining subroutine gauss2() */ 
gauss2(int n, float a[10][10],f!oat b[10], float x[10]) 


/* This subroutine solves a system of linear equations using 


Gauss elimination method with partial pivoting */ 


t 
/* Forward elimination */ 
elim(n,a,bl; 


/* Solution by back substitution */ 
bsubtn,a,b,x]; 


return; 
) /* Endot routine gauss2() */ 


/* Definim 


/* This subroutiae performs forward elimination 
incorporating partial pivoting technique */ 


й-1;Е++) 


pivot (n,a,b,k); 

for (i=k+1;i- pitt) 

{ 
factor = а[1][К} / alk] (Kl; 
for (j=k+1;j<=n; j++) 
t 
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afiJljl + afil{j]-factor * alk (jl; 
} 
blil = blil-factor*b[k]; 
} 
) 
return; 
) /* End of elimi) routine */ 


/* Defining subroutine pivot() */ 


# include emath.h» 
pivot(int n, float a[10][10], float b[10], int k) 


/* This subroutine performs the task of partial pivoting 
(reordering of equations) */ 
{ 
int Blade 
float large, temp; 


/* Find pivot p */ 
р = К; 
large = fabstalk][k]); 
for (irk+ljic=n;i++) 
{ 


if (fabs (afil{k])>Large) 

i 
large - fabs(a[i](k!); 
р = і; 


} 


/* Exchange rows p and к */ 
if(p!-k) 
{ 
for (jek;j<=n;j++) 
{ 
temp = a[plI3]; 
alp] [j] = alk} (il: 
alk] [j] = temp; 
) 
temp = blip]; 
Ыр) = ЫК]; 
bik) = temp; 
t 
return; 


) 


/* End of subroutine pivot() */ 
z 
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/* Defining subroutine bsub() */ 
bsub(int n, float a[10][10], float b[10], float x[101) 


/* This subroutine obtains the solution vector x by back 
substitution */ 


{ 
int i,j,k; 
float sum; 
xin] = bin] / а[п][п); 
fox (k=n-1;k>=1;k-~) 
{ 
sum = 0.0; 
for (jsk+1;j<=n; j++) 
sum = sum + alk] [j] * x(jl: 
x{k] = (b{k)-sum) / alk] ik]; 
) 


return; 

) 

/* End of subroutine bsub() */ 

18 зана dolo dii peg TINTS “/ 

Program 12 DOLIT 

ШИРИНИН ЕЕЕ . 
* Main program » 
i This program solves a system of linear equations * 
ы using Dolittle LU decomposition i 
. uode Posi epiac aoo M а * 
* Functions invoked * 
* NIL * 
ж а, a 
* Subroutines used e| 
* LUD, SOLVE * 
* 3s ————— B 
* Variables used * 
* n - System size * 
* a - Coefficient matrix of the system * 
* b - Right side vector "m 
(а 1 - Lower triangular matrix * 
* u - Upper triangular matrix M 
* fact - Factorization status Е 


S YES, NO ` 
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# define YES 1 
# define NO 0 


main( ) 
t 
int n, fact, i,j 
float a[10][10], u[10)[107. 11101110], b[10], x(10]; 


printf ("Nn SOLUTION BY DOLITTLE METHOD \п\п"); 


/* Read input data */ 
printf("\nwhat is size of A? \п"); 
scanf("$d", &n): 
printf (“Type coefficients a(i,j), row by row Mw); 
«-n;ie*) 
for (j=1;j<snij++) 
scanf(*$f', &e[illj]): 
printf("\nType vector b on one line \n*); 
for (i=l; i<sn;i++) 
scanf("$f", &b{i}); 
/* LU factorization */ 


lud(n,a,u,1,&fact); 


if(fact == YES) /* Print LU matrices */ 
{ 


/* Print U matrix */ 
print£("\nMATRIX U \n"); 
for(i=l;icsn;i++) 

{ 
for (j=l; j<=nij++) 


"am 
pNnt£(M15.6f*, uli) ijl); 
} \ 
print£(*\n"); 
} 
/* Print L matrix */ 
printf (*\nMATRIX L \n"); 
for (i=i;i<=n;i++) 
{ 
for (ј=1;ј<=п:ј++) 


printf(*$15.6f^, 111) (31); 


) 
printf(*Wn*); 
) 
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/* Solve for x */ 
solve(n,u,1,b,x) ; 
printf(*\nSOLUTION VECTOR X inin*); 
for(i-1;i«-n;ie«) 

printf("t15.6f Wn*, х[1]); 

printé(*\n"); 
1 
else 


{ 


printf(*An FACTORIZATION NOT POSSIBLE \n"); 


) 
) 
/* End of main() program */ 


BR. apap aiios po d ee 


/* Defining the subroutine lud() */ 


lud(int n, float a[10](10], float ul[10][10;, 


1[(10][10], int *fact) 


float 


/* This subroutine decomposes the matrix A into L and U 


matrices using DOLITTLE algorithm */ 


int i,j,k; 
float sum; 
/* Initialize U and L matrices */ 
for( pis=n;it+) 
for (j=1; jesa; j++) 
t 
v(ilti] = 0.0; 
1111051 = 0.0 


} 
/* Compute the elements of U and L */ 

for (js1;j<=n;3++) 

u[ilij] = а[11[3]; 
for(i=1;i<-n;i++) 

101111] = 1.0; 
ftor(i=l;i<=n;i++) 

1123011 = ану) и uli} (1); 


for (j=2; j<=n;j++) 
t 
for(i-2;i«-j;ie4) 
t 
sum = а[1] 15]: 
Ғог(К=1;К<=1-1;К++) 
sum = sum - lii]Ik] * u[k] [j]; 
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0127151 = sum 


} 
if(uljllil<=1.E-6) 
{ 
*fact = NO; 
return; 
} 
for(iejslpicenzin-) 
t 
sum = alillj 
for (K=1;k<s5-liker) 
sum = sum llij(k) * ulk) tji; 
1[41113] = sum / a( Gl: 
) 


1 
*fact = YES; 
return: 

) /^ Ens of subroutine ludí] 


ra: 


/* Defining the subroutine solve() */ 


solve(int п, float 1101110), float 1010) (101, float 
[10], float x[10]) 


/* This subroutine obtains the solution vector X using the 
coefficients of L and U matrices */ 
4 
int ij; 
float sum, 2110); 
|* Forward substitution */ 
2011 = bil; 
for (is2;i<-nsite) 


for(j-1:5 
sum sum + Milij) * 2131: 
2111 = bii] - sum; 
› 
/* Back substitution */ 
xin] = zln]/ulnj in]; 
for(i-n-1;i»-1;i--! 
{ 


sum = 0.0; 
for(jeisl;ije-n;j: н) 
sum = sum + u(i;[j] * x03); 
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} 
/* 


xli] - (z[i]-sum)/uli][i]; 
А 


return; 


End of subroutine solve() */ 


Main program * 
This program uses the subprogram JACOBI to solve * 

а system of equations by Jacobi iteration method * 

^ 

Functions invoked . 
NIL * 

acc RN + 
Subroutines used * 
JACOBI * 

E 

а - Coefficient matirx ` 

b - Right side vector * 

п - System size Ы 

X - Solution vector е 
count - Number of iterations completed * 
Status - Convergence status + 
Mn Pad мелана E бы Й 
Constants used е 
EPS - Error bound xi 

* 


MAXIT - Maximum iterations permitted 


#include <math,h> 
*define &PS 0.000001 
#define MAXIT 100 


main( ) 


{ 


int i,j,n,count,status; 
float a(10][10], Ь[10], x[10]; 


printf (*\n SOLUTION BY JACOBI ITERATION n*); 
printf("\nwWhat is the size n of the system? in"); 
scanf("*d*, &n); 
printf(*AnInput coefficients a(i,j), row by row \n*); 
for (i-1;i<=n;i++) 

for (j=1;j<=n; j++} 
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всапЁ(*%Ё", &alillil: 
printf (*\nInput vector bn); 
for(i ї<=п;ї++) - 

scanf("*f", &b[il): 


jacobiin,a,b,x. &eount, &status]; 


it(status--2) 


printf (*\nNo convergence in td iterations", MAXIT); 
printé("\n\n"); 


H 
else 
( 
printf ("\n SOLUTION VECTOR X nli 
for (i=1;is=niit++) 
printé£ (*815.6f", «lille 
printf (*\n\nIterations = за", count): 
) 


/* End of main() program */ 
ET 


/* Defining the routine jacobi() */ 


jacobi(int п, float al20) 110], float 0110), float x(i0], int 
*count, int *status) 


/* This subroutine solves а system of n linear equations using 
the Jacobi iteration method */ 
t 
int 1,},Кеу; 
float sum, x0[10]; 


/* Initial values of x */ 


for (i= pitt) 
x0li] = bli] / a(iltil: 
|* Jacobi iteration begins */ 
*count = 1: 
begin: 
key = 0; 


/* Computing values of x(i) є 
for(i-1;iec-n;i**) 


sum = bli]; 
Ғог(3=1:ј<=п;ј++) 
{ 


continue; 


Sum = sum - a[ij][j] * xQ[j]; 


$ 
x[i] = sum / ali} lij; 
if (key 0) 


{ 


/* Testing for accuracy */ 
if(fabs((x(i]-x01:1) /x(i]) 
key = 1; 


) 


/* Testing for convergence */ 
if (key += 1) 
it 
if(*count--MAXIT) 


*status 2; 


return; 
) 
else 
i 
“status = 1; 
for(i-l;is.n;i4.) 
ХОГ = xii 


) 
*count = *countil; 
goto begin; 
) 
return; 
} 
/* End of Subroutine jacobi() +7 


/*- 


Main program 
This program uses the subprogram GASEID to solve a 
System of equations by Gauss-Seidel iteration method 


+ 


NIL 


* Wok жа 


> FPS) 
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* gubroutines used Ы 
*  GASEID + 
— 2 * 
* Variables used * 
s a - Coefficient matrix * 
* b - Right side vector Є 
ы n - System size Е 
Ы x - Solution vector P 
% count - Number of iterations completed £ 
. status - Convergence status Ы 
. casar 
* Constants used * 
* EPS - Error bound bi 
# MAXIT - Maximum iterations permitted * 
аараан отба at EE ана */ 


# define MAXIT 50 
# define EPS 0.000001 


main( ) 

{ 
float a[10][10], b{10]l, x[101; 
int i,j,n,count,Status; 


printf ("\n SOLUTION BY GAUSS-SEIDEL ITERATION Nn*); 


print£("\nWhat is the size n of the system? Mn); 
scanf("$d", &n); 
printf("WnInput coefficients a(i,j), row by row \n"); 
printf(*one row on each line Wn"); 
for(i=l;i<=n;it+) 

for (j=1; i<en:5++) 

scanf (“$f*, &ali]l3]); 

printf(*MnInput vector b\n"); 
for(i=1;i<=n;i++) 

scanf("sf", &blil); 


gaseid(n,a,b,X. &count, &status); 


if (status==2) 


print£(*\nNo convergence in &d iterations Wn", MAXIT); 
printf (“\п\п") B 

1 

else 


printé(*\n SOLUTION VECTOR X \n\n"); 
for(i=1;i<=n;it+) 
printf("&15.6f^, х[11); 
print£(*\n\nIterations = $a", count); 
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) 
* End of main() program */ 


stj 


/* Defining the routine gaseid() */ 


gaseid(int n, float а{10][10), float b{l0], float x[10], int 
*count, int *status) 


/* This subroutine Solves a system of linear equations using 
the Gauss-Seidel iteration algorithm */ 


int i,j,key; 
float sum, x0[10]; 


/* Initial values of x *; 
for(isl;ican;iv+) 
x0lil = bli) / а[11[%); 


/* Gaseid iteration begins */ 
*count - 1; 
begin: 
key = 0; 
/* Computing values of xit у 
#ог{1-1;1<е=п;1++) 
( 
sum = b[ij; 
for (j-1; je-nz je) 
i 


ЇЁ(і==]) 

continue; 

Sum = sum - a[ij](j] * x0(j]; 
) 
X[i] = sum / ali) fi]; 
if(key--0) 


1 


/* Testing for accuracy */ 
if (fabs(Ix[i]-x0(1]) /xLi] ) -EPS) 
key - 1; 


) 


/* Testing for convergence */ 
if(key--1) 
1 
if(*count--MAXIT) 
t 
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‘status = 2; 


return; * 


Program terminated */ 
) 


else 


x0li] = x[i]; 


) 
*count = *count+l; 
goto begin; 

* 

return; 


* End of routine gaseid() */ 


* -Маіп program 


* (This program computes the interpolation value at a Ӯ 
Ы specified point, given а get of dala points, using * 
* the Lagrange interpolation representation * 
tnr oR Serta n 
* Punctions invoked ii 
* NIL $ 
"Ий НИНИН ш . 
* * 
* NIL s 
"ИИИНИН ТЕЕ ies scie acer * 
* * 
* A - Number of data sets > 
ж x(i) - Data points * 
* £(i) - Function values at data points Ы 
* xp - Point at which interpolation is required * 
i fp Interpolated value at XP * 
* it Lagrangian factor * 
КИ ER cpi ld * 
* Constants used * 


* MAX - Maximum number of data points permitted * 


&define MAX 10 


main( ) 


( 
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int n,i 
float x[MAX), f [MAX] , £p, 1f, sum, xp; 


printf("\nInput number of data points, n \п*); 
scanf(**d*, &n); 


print£("\nInpot data points x(i) and values fü) \n"); 
printf(*(one set in each line)\n") 
for(i=1;i<=n;i++) 

scanf("tf ФЕ", axli], &f[i]); 


printf£(*\nInput x at which interpolation 
is required. \п”); 
Scanf("&f*, &xp); 


sum - 0.0; 
for(isl;ie-n;is«) 
( 
lf = 1. 
for (j=1;j<=niji+) 
t 


if(it=i) 
1# = 1£*(xp-x[j]}/ (xli ]-x (31) ; 
) 
sum = sum + 1f * fti]; 
j 3 
fp = sum; 
printé(*\n LAGRANGIAN INTERPOLATION \n\n"); 


print{("Interpolated function value \n“); 
Printí("at x-$f is $f n*, xp, fp); 


} 
/* End of main() program */ 


* Main program 

* This program constructs the Newton interpolation 
polynomial for a given set of data points and then 

ы computes interpolation value at a specified value 


t... 


Functions invoked 
NIL 


^ 
* Subroutines used 
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* Variables used * 
* n - Number of data points * 
++ x - Array of independent даса points * 
* f - Array of function values . 
* xp - Desired point for interpolation * 
* fp - Interpolation value at XP * 
* £ - Difference table > 
* a - Array of coefficients of interpolation * 
* polynomial = 
epe * 
* 
* 

d 


int i,j,n; 
float xp, fp, sum, pi, x410}, #110], af10], d(10] [10]; 


printé£(*\nInput number of data points \п*); 
Scanf("&d*, &n); 

printf("\nInput values of x and f(x), M"); 
printf ("One set on each line Wn"); 


for(i=1;i<c=n;i++) 
scanf ("4E SE, &x[i], &flil); 7 
/* Construct difference table */ 
for(isl;i«-zn;ie*) 
911701) = #141: + 


for (ј=2;ј<=п;ј++) 
for (i=1;i jr; i++) 
ай] = (44+1113-11-41413 43) / бї+3-11-х{41); 


/* Set the coefficients of interpolation polynomial */ 
#ог(ј=1;ј<=п;3++) 
ati) = 9111031; 


/* Compute interpolation value */ 
printf (“\nīnput xp where interpolation is required ма”); 
scanf("*$f*, EXD); 
sum = allj; 
for (i=2;i<=njit+) 
И 
рї = 1.0; 
Ғот(ј=1;ј<=1-1;]++) 
pi = pi * (хр-х[5]); 
sum = sum + ali] * рі; 
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) 
fp - sum; 


/* Write results */ 
printf(*Xa*); 
printf(* NEWTON INTERPOLATION \п*); 
printf(*An*); 
printf(*Interpolated value \n"); 
printf(*at x = $f is $f \n", xp, fp); 


) 
/* End of main() program */ 


* Main program * 
* This program computes the interpolation value at * 
+ а specified value, given a set of table Points, * 
ш using the natural cubic spline interpolation * 


* Functions invoked 


Subroutines used * 
GAUSS . 


Variables used m 
- Number of data points. » 
- N by 1 array of data points. * 
7 N by 1 array of function values nd 


n 
x 

f 

xp - Point at which interpolation is required 
fp - Interpolation value at XP 

4 - Array of second derivatives (N-2 by 1) * 
d - Array representing (N-2 by 1) 

с 


- Matrix (N-2 by N-2) representing the * 
coefficients of second derivatives * 

h - Array of distances between data points + 
(h(i) = x(i) - x(i-1) ) * 

df - Array of differences of functions * 


Constants used 
» MAX Maximum number of table points permitted * 


fdefine з 10 


main( ) 
t 
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int i,j,n,m; 
float x([Sl],f[S!,aIS] , d(S], c [S] [S], h[S] , d£ [S] ,u[S], 
хр, #р,91,92,93; 


/* Reading input data */ 


printf£("\nInput number of data points, n \n*); 
scanf("td", &n); 


printf("\nInput data points x(i) and function \n* 
printf(*values f(i), one set on each line Wn*); 
#ог(1=1;1<=п;1++) 

scanf(*&f &f^, &x[i],&f[i]); 


printf£("\nInput point of interpolation Vn"); 
scanf("&f^', &хр); 
/* Compute distances between data points and function 
differences */ 


for(is2;i<=n;i++) 
{ 
h(i] = x[i] - xli-1]; 
df(i] = £4] - f[i-1]; 
n > 
/* Initialize C matrix */ 
i<en-1 
j<=n-1; j++) 
[4113] = 0.0; 


/* Compute diagonal elements of c */ 
for(i-2;i1«-n-1;i«*) 
c[illi] = 2.0 * (h[i] - h(i*1]); 


/* Compute off-diagonal elements of c */ 
for(i-3;i«-n-1;i**) 
{ 
e(i-11] fi} = h(il: 
cfil(i-1] = hli]; 
) 
/* Compute elements of d array */ 
#ог(ї=2;1<=п-1;1++) 
afi] = (dflisi]/h(i-1] - df[i]/hti]) * 6.0; 


/* Compute elements of a using Gaussian elimination. 
Change array subscripts from 2 to n-l to 1 to n-2 
before calling gauss() subroutine. */ 

m - n-2; 
for (i=1; i<=m;i++) 


t 
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d(i) = 4[1+1]; 
for(j=1;j<=m; j++) 
с(11131 = с{1+1][3+1]); 
) 


gauss(m,c,d,a) ; 2 


/* Compute the coefficients of natural cubic spline */ 
for (isn-1;i>=2;i--) 
ali] = a[i-1]; 
a[i] = 0.0; 
a[n) = 0.0; 
/* Locate the domain of xp */ 
for(i=2;i<=n;i++) 
{ 
if(xp <= х[1]) 
break; 
) 


/* Compute interpolation value at xp */ 
u[i-1] = xp - х[1-1]; 


uli] = xp - xfil; 

ql = hli]*h(i] * uli} - uli]*ulil*u[i]; 

92 = ufi-1]*uli-l]*u[i-1] - h[i]*h(i] * u[i-1]; 
q3 = fti]*uli-1] - fli-1]*ulil: 


fp = (а[1-1]*91 + a[i]*q2)/(6.0 * h[i]) + q3/hlil; 


* Write results */ 
printf("in SPLINE INTERPOLATION in \n"); 
printf (“Interpolation value = $f \n*, fp); 

) 

/* End of main() program */ 


M icio di cdd a rae E бед: +) 
/* Defining gauss() subroutine */ 
gauss(int n, float a[10)[10], float b[10], float x[10]) 


/* This subroutine solves a set of n linear equations using 
Gauss elimination method */ 


int i,j,k; 
float pivot, factor, sum; 
/* ----------- Elimination begins -— 
for(k-l;k«-n-l;ks*) 
t 
pivot = a(k] [k]; 
for(i=k+1;i<=n;i++) 
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t 
factor = а[і][х] / pivot; 
for(j=k+1;j<=n;i++) 
а[11131 = alil(j)] - factor * alk] [i]; 
bli] = bli] - “factor * bik}; 
} 


ў 


/* Back substitution begins */ 
xin] = bin]/aInl [n]; 
for (k=n-1;k>=1;k—) 
t 
sum = 0.0; 
for (j=k+1; jeen; j++) 
sum = sum + a[k](j] * x[il; 
x(k] = (b[k]-sum) /a(k) [X]: 
} 
return; 
} 


/* End of subroutine gauss() */ 


ji эзерли ea a eee ЇЙ 


Program 18 LINREG 


ҮТ —————— See желк 


* Main program 
This program fits a line Y = A + BX to a given 


Functions invoked 
Library function fabs() 
Subroutines used 
NIL 
Variables used 
X,y - Data arrays 
n Number of data sets 
sumx - Sum of x values 
sumy - Sum of y values 
sumxx - Sum of squares of x values 
sumty - Sum of products of x and y 
xmean - Mean of x values 
ymean - Mean of y values 
a - y intercept of the line 
b - Slope of the line 


&OW o» € * X* * » * 9*9 **»*** 
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* Constants used 
* MAX - Limit for number of data points 
* EPS - Error limit 


TEF, 


#include <math.h> 
#define MAX 10 
Kdefine EPS 0.000001 


main( | 
( 
int 1,n; 
float x[10], yl10]; 
float sumx, sumy, Sumxx, Sumxy, Xmean, ymean, 
denom, a, b; . 
printf(*in LINEAR REGRESSION \n \л*); 


/* Reading data values */ 
printf("ininput number of data points, n \n"); 
scanf("&d", &n); 
print£("\nInput x and y values Mn"); 
printf(*One set on each line in"); 
for(i=l;i<=n;irr) 
scanf("sf tf", &x[i], &yli]); 


/* Computing constants a and b of the linear equation */ 
sumx = 0.0; 


e) 


sumx = sumx + x[i]: 
sumy = sumy + y[i]; 
sumxx = sumxx + x[i] * xlil; 
sumxy = sumxy + xli] * yli); 
Li 
xmean = sumx/n; 
ymean = sumy/n; 
denom = n*sumxx - sumx * sumx; 
if(fabs(denom) > EPS) 
i 
b = (n*sumxy - sumx * sumy)/derom; 
а = ymean - b * xmean; 
printf£("\n LINEAR REGRESSION LINE y = athx \n"); 


printf("\nThe coefficients are: \n"); 
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printf(*a = $f \n", а); 
printf(*b = %£ \п", b); 
) 
else 
( 
print£(*in NO SOLUTION \n*); 
) 


) 
/* End of main() program */ 


* Main program e 
ы This program fits a polynomial curve to a given * 
* Set of data points by the method of * 
* least squares à 
* ` 
* Functions invoked » 
* NTL E 
S apium L———————— аА м 
* Subroutines used Ы 
* normal, gauss jd 
* * 
* Variables used i" 
* х,у - Arrays of data values + 
- n - Number of data points * 
Ы mp - Order of the polynomial under construction * 
ж m - Number of polynomial coefficients * 
i c - Coefficient matrix of normal equations Ж 
ы b - Right side vector of normal equations ө 
* а - Array of coefficients of the polynomial * 

ЕС 
* Constants used ы 
* MAX — Maximum number of data points e 


#include <math.h> 
#define MAX 10 


main( ) 
{ 
int n,mp,m,i; 
float x[MAX],y MAX], c [MAX] [MAX] ,a [MAX] , b[MAX]; 


printf("An POLYNOMIAL REGRESSION inWn*); 
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/* Reading values */ 
print£(*Input number of data points n \п"); 
scanf(**d*,&n); 
printf("Input order of polynomial, шр required \n"); 
scanf (*&d* , amp) ; 
printf(*Input data values, x and y \n"); 
printf(*one set on each line in"); 
for(i-1;i ;1++) 
scanf(*&£ ФЕ", &х[1], &ylil); 


/* Testing the order */ 
if(n <= mp) 
( 
printf(*Xa REGRESSION 15 NOT POSSIBLE \n”); 
goto stop; 


) 

/* Number of polynomial coefficients */ 
m= пр + 1; 

/* Computation of elements of c and b "/ 
normal (x, y, c,b,n,m) ; 


/* Computation of coefficients а(1) to а(т) */ 
gauss (m,c,b,a) ; 


/* Printing of coefficients а(і) */ 
printf (“\n POLYNOMIAL COEFFICIENTS \n\n"); 
for (i=1;i<=m;itt) 
printf(*&15.6f*, alil); 
printf(*Wn*); 
stop: 
printf ("END"); 


/* End of main() program */ 


ye - 

/* Defining the subroutine normal() */ 

normal(float x[MAX], float y[MAX], float c[MAX] [MAX], 
float b[MAX], int n, int m) 


/* This subroutine computes the coefficients of normal 
equations */ 
{ 
int i,j,k,11,12; 


for (j=1;j<=m;j++) 


pk<=m;k++) 
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БЕШ 
11 = ke 
for(i-1;i«-n;i«.«) 

CIjHK] = cLillk] + pow(xLi],11); 


H 


} 
for (j=l; je=m;j++) 
{ 

bij] = 0.0; 

12 = j-1; 


for (1=91;1<=п;1++) 
Ъ{31 = bijleylil * ром(х[11,12); 
) 
return; 
) 
/* End of subroutine normal() */ 


disc a btn anemones, lyf 


/* Defining gauss() subroutine */ 
gauss(int n, float a[MAX] [MAX], float b[MAX], float x|MAX]) 


/* "his subroutine solves a set of n linear equations by Gauss 
Elimination method */ 
{ 
int i,j,k: 
float pivot, factor, sum; 


/* Elimination begins */ 
for (k=1;k<=n-1;k+4) 


{ 
Pivot = alk] [k]; 
for{i-k+l;icen;i++) 
i 
factor = afi] /pivot; 
for(j-k«1; j«en 
а[11131 1 - factor * а[к) [jl; 
b[i] = bli] - factor * ЫК]; 
} 
} 


/* Back substitution begins */ 
x[n] = bí[n]/a[n] [n] ; 
-l;k»-1;k--) 


sum = 0.0; 
for(j-k«1;je-n;js«) 
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sum = sum + alkl(j] * хЇ3}; 
xik] = (b[k]-sum)/a[k] (к) ғ 
) 
return; 
) 
/* End of subroutine gauss() */ 


This program computes the derivative of а tabulated 
function at a specified value using the Newton 
interpolation approach 


Subroutines used 
NLL 


pid BREST eS 


. 


Variables used 
n - Number of function values given 


* 
* x - Array of values 

bd £ - Array of function values 

* a - Array of coefficients of Newton polynomial 
* d - Difference table 

Ы xp - Desired point of differentiation 

E dif - Derivative of the function at XP 


за Mah PSR E 


int і,ј, к,п; 
float x[10],£(10],a(10] ,d{10] [10] ,xp, dif, sum, p; 


/* Reading input data */ 
print£(*\nInput number of data points \n”); 
scanf(**d*, &n!; 
printf£("\nInput values of x and f(x), An"); 
printf(*one set on each line \n"); 
for (i=1;i<=nzi++) 
scanf ("sf ЗЕ", &х[1], &flil); 
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/* Constructing difference table d */ 
for(i=1;i<=n;i++) 
d(i][1] = f(il; 
for (j=2; j<=n; j++) 
for (i=1;i<en-j+1;it+) 
4(11{31 = (а[1+1]113-1] - atli] (j-1))/(x[i+j-1] - 


х{4]); 


/* Set the coefficients of Newton interpolating polynomial */ 

for (j=1;j<=n; j++) 
alj) = alil (j); 

/* Compute derivative at a given x = xp */ 
printf("AnInput xp where derivative is required \n"); 
Scanf(**f*, &xp); 
dif = a[2]; 
for (k=3; k<=n; k++) 

( 
sum = 0.0; 
for (i=1; i<=k-1; i++) 
1 
р = 1.0; 
for(j=1;j<=k~1; j++) 
t 


if(i == j) 
continue; 
р= р * (xp ~ х[3]); 


} 
sum = sum + p; 
} 
dif = dif + alk] * sum; 
} 
/* Write results */ 
printf£(*\nNUMERICAL DIFFERENTIATION USING *); 
Printf(*NEWTON POLYNOMIAL inWn*); 
printf (“DERIVATIVE at x = &f is àf Mn*, xp,dif); 


/* End of main() program */ 


Program 21 TRAPE1 
6 esce а 


* Main program 


* 
This program integrates a given function ы 
using the trapezoidal rule * 
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E 


+. 


ETETE RES 


Functions invoked 


Subroutines used 


P(x) 


Variables used 
a - Lower limit of integration 
b - Upper limit of integration 
h - Segment width 
n - Number of segments 
ict - Value of integral 


Constants used 


#include «math.h» 


main( } 


{ 


int n,i; 
float a,b,h,sum, ict; 
float F(float x): 


printf (*\nGive initial value of x \n"); 
scanf("t£*,&ka); 
print£(*\nGive final value of x Wn"); 
scanf(*&f', &b); 
printf("NnWhat is the segment width? An"): 
scanf("$f*, &h); 
n = (b-a)/h: 
sum = (F(a) + F(b))/2.0; 
for (i=1;i<en-1;it+) 
t 
sum = sum + Е(а+і*ћ); 


) 
ict - sum * h; 


printf (*\n”); 

printf ("Integration between $f and $f \n*, a,b); 
printf(*When h - ФЕ is ЖЕ An^, h, ict); 
printf(*Wn*); 


End of main() program */ 
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float F(float x) " 


f 
float f; 
f = 1.0-exp(-x/2.0); 
return (f); 
) 
/* End of subroutine F(x) */ 
T T PEIPER “/ 
Program 22 SIMS1 
T laii 2-7 a NI eS ы ы, 6 
* Main program rd 
Ы This program integrates a given function * 
. + 
E * 
* Functions invoked * 
* — Macro F(x) * 
. ^ 
B ^ 
» NIL ^ 
MN айырба р icc ie qt E. * 
* Variables used ч 
* а - Lower limit of integration es 
m b - Upper limit of integration Ы 
" h - Segnent width s 
" n - Number of segments а 
* ics - Value of the integral * 
* * 
* NIL а 
Osa et ee ine a РУ; 


#include <math.h> 
Wdefine Р(х) 1 - exp(-(x)/2.0) 


main( ) 
t 
int n,m i; 
float a,b, h, sum, ics, x, f1, f2, f3; 


printf("\nInitial value of x Mnt); 

Scanf("*&f^, ба); 

printf£("\nFinal value of x in*); 

scanf(*$f*, &b); 

printf("\nNumber of segments (EVEN number) Wn*); 
scanf("$d*, &п); 
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h = (b-a)/n; 
m= n/2 
sum = 0.0, 
х = 8i 
£l = Fix): 
for(i=l;i<=miirt) 
t 

#2 = F(x*h); 


£3 = Р(х+2*'); 
sum = sum + fl + A*£2 + £3; 
£1 = £3; 
x = х + 2" 
) 
ics = sum * h/3.0; 
printf (*\nIntegral from $f to ЪЁ М", a,b): 
printf ("When h = $f is ФЕ \п", n, ics): 


y* End of main() program ж; 


[+ энене = m . 
* Main program * 
* This program performs Romberg integration ы 
* ру bisecting the intervals N times * 
"s aa " 
+ punctions invoked 
* Macro F(x), Library function fabs() 


* 
* 

* 

` 

. 

* 

- Starting point of the interval ж 
D 

* 

* 

* 

* 

` 

* 


* variables used 

, a 

* p - End point of the interval 

id h - Width of the interval 

* n - Number of times bisection is done 
= m - Number of trapezoids 

* r - Matrix of Romberg integral values 
* Constants used 

* EPS - Error bound 
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finclude «math.h» ' 
*define EPS 0.000001 
#define Р(х) 1.0/(x) 


main( ) 
{ 
int i,j,k,m,n; 
float a,b,h,sum,x,r[10][10]; 


printf(*VXnInput end points of the interval \n"); 
scanf("$f ФЕ", &a,&b); 

printf("MnlInput maximum number of times"); 
printf("\nthe subintervals are bisected \n"); 
scanf("$d', &n); 


/* Compute area using entire interval as one trapezoidal */ 
h=b - a; 
ї[1){1] = h * (F(a)*F(b))/2.0; 
printf(*Yn$15.6f An*, r[1][1]); 


/* Process of Romberg integration begins */ 
for(i=2;i<=n+1;i++) 
t 
m = pow(2,(i-2)); /* trapezoidal for ith refinement */ 


h - h/2; /* bisect step-size */ 

/* Use recursive trapezoidal rule for m strips */ 
sum = 0.0; 
for(k-l;k«-m;ke«) 


i 
X = a + (2*k-1)*h; 
sum = sum + Р(х); 
) 
rli][i] = r[i-1][1]/2.0 + h*sum; 


/* Compute Richardson's improvements */ 
for(j=2;j<=i; j++) 
it 
rli] (9) fil (5-2 + (rti) (5-1) oc i1) (3-11) / (owl, j-1) -1) + 


/* Write results of improvements for ith refinement */ 
for (j=1;j<=i;j++) 
printf(*$15.6f*, riilljl); 
print£("\n*); 


/* Test for accuracy */ 
if(fabs(r[i-1][i-1] - rlil][i]) < EPS) 
/* Stop further refinement */ 
í 
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printf(**n*); 
printf (*ROMBERG INTEGRATION = ФЕ \n", r(ilHil)g 
printf ("n"); 
goto stop; 
1 
else 


{ 


continue; 


) 
) 


/* Write final result */ 
print£(*NnROMBERG INTEGRATION = ФЕ va^, к{п+1}[п+1)); 
printf(*(Normal exit from loop) Wn); 
stop: 
printf ("End"); 
) 


/* Bnd of main() program */ 


КИНИНИ ЕЕ 
Program 24 TRAPE2 
Je ааыа anima nS Cibi a 


* Main program 
+ — This program integrates a tabulated function 
i using the trapezoidal rule 


Functions invoked 
Library function fabs() 


ы “gubrouti nes used 


* NIL 

È cited des ptam aS B 
* Variables used Б 
* n- Number of table points * 
y x- Array of independent data points $ 
* y- Array of function values * 
* a- Lower limit of integration * 


b- Upper limit of integration 
h- Distance between points 

B ni- Position of A in the table 
* n2- Position of B in the table ж 
*  ict- Value of integral 


P NIL 
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#include <math.h> 
#define MAX 15 


main( ) 
{ 
int n,nl,n2,i; 
float a,b,h,sum, ict,x[MAX] , y MAX] ; 


/* Reading table values */ 
printf(*Number of data points \n”); 
scanf("*$d", &n); 
printf("\nInput table values, set by set in"); 
for(i=l;i<=n;i++) 
scanf ("8f $f", &x[il,&ylil): 


/* Reading the limits of integration */ 
printf("Initial value of x in"); 
scanf("tf^,&a); 
printf("Final value of x \л*); 
scanf("$f*, &b); 
printf(*MnWhat is segment width? \n*); 
scanf (“%$f", &h); 


/* Computing the position of initial and final values */ 
nl = (їпє)(#айв(а-х[1))/һ+1.5); 
n2 = (int) (fabs (b-x[1})/h+1.5); 


/* Evaluating the integral */ 


і<=п2-1;1++) 
sum = sum + y[i] + у[ї+1]; 
ict = sum * h/2.0; 
printf("AnIntegral from &f to $f is $fAn*, a,b, ict); 
) 
/* End of main() program */ 


n qe UEM tr acto gem E E е Пана T 
Program 25 SIMS2 
Ж "ag cian phe te he eee ate + 


* Main program 

* This program integrates a tabulated function 
i using the Simpson's 1/3 rule.If the number of 
* segments is odd, the trapezoidal rule is used 
^ for the last segment. 

* Functions invoked * 
joi Library function fabs() 
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* Subroutines used 


* NIL * 
diem O€———— ` 
* Variables used * 
s n- Number of table points * 
> x- Array of inđependent values ¥ 
" y- Array of function values * 
* a- Lower limit of integration 


b- Upper limit of integration 

h- Distance between independent values 

п1- Position of A in the table 

Position of B in the table 

il- Area computed using Simpson's rule 

i2- Area of the last segment (by trapezoidal rule) 
Value of integral 


resto 
-H 
ю 
' 


^ 
* Constants used 
* 


eee eee n ow ow ow o9 


NIL 


Winclude «math.h» 
main( ) 
{ 
int i,n,n1,n2,m,1; 
float x[15],y[15],a,b,h, sum, ics, i1,i2; 


/* Reading table values */ 
printf("Input number of data points An"); 
scanf("Med^, &n); 
printf("\nInput table values, set by set \n'); 
for (i=1;i<=n;it+) 
scanf ("sf f", &x[i],&ylil); 


/* Reading the limits of integration */ 
printf(*Input initial value of х Wn*); 
scanf (*%f*, Ба); 
printf ("Input final value of x \n"); 
scanf("$f", &b); 
printf(*What is segment width? Wn*); 
scanf(*$f*, &h); 


/* Computing the position of initial and final values */ 
nl = (int) (fabs(a-x[1])/h*1.5); 
n2 = (int) (fabs(b-x[1])/h*1.5); 


/* Testing for even intervals */, 
m - n2 - ni; 
if(m$2 == 0) /* m is even */ 
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( 
i2 = 0.0; ! 
152-2: ; 


else /* m is odd */ 

Ц 

/* Use trapezoidal rule for the last strip */ 
12 = (y[n2-1]*y[n2]) *h/2.0; 
1 =й = З; 

1 

/* Use Simpson's rule for 1 strips */ 

sum = 0.0; 

Ғог(і=п1;і<=1;1=1+2) 
sum = sum + yli] + 4*yliel] + у[1+2); 

il = sum*h/3.0; 


/* Integral is sum of il and i2 */ 
ics = il + 12; 


/* Writing the results */ 
printf£(*\uIntegral from &f to ФЕ is &f An^, a,b,ics); 


/* End of main() program */ 


* Main program * 
v ‘his program estimates the solution of the first * 
* order differential equation у’ = f(x,y) at a given * 
* point using Euler's method i 
+ . 
* Functions invoked Ы 
^ NIL * 
een — —— | 
* Subroutines used * 
* * 
* Variables used T 
ы x- Initial value of independent variable x 
s y- Initial value of dependent variable * 
* xp- Point of solution jd 
* h- Incremental step-size E 
* n- Number of computational steps required * 
* dy - Incremental Y in each step Là 
* * 
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* Constants used 


ө NIL 
es T ————————— ‚/ 
main( ) 
£ 
int i,n; 
float х,у, xp, h, ду; 
float func(float, float); 
printé("\n SOLUTION BY EULER'S METHOD (n \n"); 
/* Reading initial data "E 


printf (“Input initial values of x and y \n"); 
scanf (“$f $f*, &x,&y); 

printf (“Input x at which y is required in"); 
scanf(*&f^, &xp); 

printf(*Input step-size, h Wn*); 

scanf(*$f*, &h); 


/* Compute number of steps required */ 
n = (int)((xp-x) /h«0.5] ; 


/* Compute y recursively at each step */ 
for(i-1;ic-n;ie«] 


( 

ду = h * fune(x,y); 
x-x*h 

y= y4 yr 


printf("$5d $10.6f $10.6f \п", i,x,y); 
) 


/* Write the final result */ 
printf("AnValue of y at x = $f is %# \n", х,у); 


/* End of main() program */ 


Га 
/* Defining subroutine func() */ 


float func(float x, float у) 


t 
float f; 
f = 2.0 * y/x? 
return(f); 

} 


/* End of subroutine func() */ П 


Program 27 HEUN 


y* 
* 
* 


^ 


n 


"n 


Main program * 
This program solves the first order differential * 
equation у’ = f(x,y) using the Heun's method + 


Functions invoked Ы 
NIL * 
i ea Go Hn i A EMI de Ee E pia aT E 
Subroutines used id 
func(] m 
PNIS EM 1——— Pn * 
Variables used * 
x - Initial value of independent variable Ы 
у - Initial value of dependent variable * 
Xp - Point of solution * 
h - Step-size * 
п Number of steps * 
omes енемен йде нганы рад аы нырак * 
Constants used * 


NIL ^ 


int i,n; 
float x,y,xp,h,mi,m2; 
float func(float, float); 


printf (“\n SOLUTION BY HEUN'S METHOD An in"); 


Reading initial data */ 
printf (“Input initial values of x and y \n"); 
scanf ("$f ФЕ", &x,&y); 
printf(*Input x at which y is required \п”); 
scanf(*$f*, &xp); 
printf (“Input step-size, h n*); 
scanf(*$f*, ьп); 


Compute number of steps required */ 
n = (int) ((xp-x) /h+0.5); 


Compute y recursively at each step */ 
for(i=1;i<=n;i++) 
{ 
ml = fune(x,y); 
m2 = fune(x+h,y+mi*h) ; 
x= x + hy 
у = у + 0.5*h* (n1«a2) ; 
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Га 


) 
n 


ye 


printf ("85d $10.6f %10.6f Ant, 1,х,у); 


) 
write the final resuit */ 


printf("AnValue of y at x = $f is ef \п", х,у); 


End of main() program */ 


Defining subroutine funcl) */ 


float func(float x, float у) 


( 


› 
ye 


float E; 
£ = 2.0 * ү/х; 
return(f); 


End of subroutine func() */ 


Program 28 POLYGN 


"n 
" 
D 


Functions invoked 
NIL 


Subroutines us 
func() 


Variables used 
x - Initial value of the independent variable 
y - Initial value of the dependent variable 
xp- Point of solution 
п - Incremental step-size 
n - Number of computational steps required 


NIL 


int i,n; 
float x,y,xp,h,mi,m2; 
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float func(float, float); 

printf("Mn SOLUTION BY POLYGON METHOD An Yn*); 
/* Reading initial data */ 

printf("Input initial values of x and y Wn*); 

scanf ("$f tf, &x,&y): 

printf(^input x at which y is required \п”); 

scanf(**f', &xp); 

printf("Input step-size, h \n"); 

scanf("*f^, ЕН) г 


/* Compute number of steps required */ 
п = (int) ((xp-x)/h«0.5); 


/* Compute y recursively at each step */ 
for(isl;issn;i++) 


{ 
mi = funcix,y): 
m2 = ®©шпе(х+0.5*п,у+0.5*һ*т1); 
x= x + h; 


y =у mh; 
ргїпє#(*®5@ $10.6f $10.6f Wn", i,x,y); 


t 


/* Write the final result '/ 
printf("\nvalue of y at x = &f is tf \n", х,у); 


) 
/* тла of main() program */ 


/* Defining subroutine func() */ 
float func(float x, float y) 
t 


float f; 
£ = 2.0 * y/xi 
return (f); 


* Main program " 
* This program computes the solution of first order Ы 
$ differential equation of type y' = f(x,y) using * 
* the 4th order Runge-Kutta method T 
* 
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* Functions invoked 


y func() 

HE ———X———————MM * 
* Variables used » 
ж x- Initial value of independent variable © 
+ y- Initial value of dependent variable i 
. xp- Point of solution . 
* h- Step-size А 
` n- Number of steps * 


int i,n; 
float x,y,xp,h,ml,m2,m3,m4; 
float func(float, float): 


printf("in SOLUTION BY 4th ORDER RK METHOD inn"); 


/* Input initial data */ 
print£(“input initial values of x and y Wn"); 
scant (“AË &f*, ах, ку): 
printf("Input x at which y is required in"); 
scánf("&f", &xp); 
printf("Input step-size, h in*); 
scanf("*$£^, &h); 


/* Compute number of steps required */ 
n = lint) ((хр-х) /h«0.5) r 


/* Compute y at each step */ 
printf(*Nn*); 
т зе A есы ыссы eee So Mnt); 
printf(* STEP х * An"); 
printf(* --- 
#ог(ї=1;1<=п;1 ++) 


ml= fune(x,y); 

т2 = fune(x+0,5*h, у+0.5*т1*һ); 
m3- func{x+0.5*h, y+0.5*m2*h); 
тА = func(x+h, y+m3*h); 
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х= x+h; 
у= у + {т1+2.0*т2 + 2.0*m3 + m4) * h/6.0; 
print£("$5d 915.6f %15.6Ё \n", i,x,y); 
} 
р: ха а”); 


/* Write the final value of у */ 
printf("AnValue of y at х = $f is $f \n", х,у); 


ї 
/* End of main() program */ 


ЙУ] senha irc ia ac asi +] 
/* Defining subroutine func() */ 


float func(float x, float у) 
( 

float f; 

Е = 2.0 * y/x; 

return (f); 


/* End of subroutine func() */ 


* Main program * 
This program solves the first order differential * 
equation y' = £(x,y) using Milne-Simpson method * 
uum + 

* 


* Functions invoked 
d NIL * 


* Subroutines used £ 
ud func() - 
* . 
* Variables used Б 
d x(1)- Initial value of independent variable * 
* y(1) - Initial value of dependent variable * 
Д xp- Point of solution * 
е n- Number of steps * 
е h- Step-size bi 
* x- Array of independent variable т 
* у- Array of dependent variable Е 


бя» 


int i,n; 
float h,x[15],v[15],xp,ml,m2,m3,m4, f2, £3, £4, f5; 
float func(float, float); 


/* Reading initial values */ 
printf("Input initial values of x and y Wn"); 
scanf (“$f ФЕ", &x(l],&y(11): 
printf ("Input value of x at which у is required in^); 
scanf(*$f*, &xp); 
printf(*Input step-size, h \n*); 
scanf("&f*, &h); 


/* Compute number of computation required */ 
n = (int) ((хр-х[11) /h«0,5) ; 


/* We require four starting points for Milne-Simpson 
method. 
Initial 

values form the first point. Remaining three points are 
obtained 

using 4th order RK method */ 


printf(*NnINITIAL VALUES ARE $10.6f $10,6£", x[1],y[1]); 


/* Computing three more points by RK method */ 
print£(*\n\nTHREE VALUES BY RK METHOD \п*); 
Ёог(1=1;1<=3;1++) 

{ 

ml = fune(x(i],ylil): 

m2 = func(x[i]*0.5*h, y{il+0.5*ml*h); 

m3 = func(x[i]+0.5*%h, у[1]+0.5*т2*һ); 

må = Func(xfi}+h, уназ) 

x[i*1] = x[i] + h; 

y[i+1] = yli] + (ml-2.0*m2 + 2.0*m3 + m4)*h/6.0; 

printf(*An$5d $10.6f $10.6f*, i, x[icl],ylie1]); 
) 


/* Computing values by Milne-Simpson method */ 
printf ("\n\nVALUES OBTAINED BY MILNE-SIMPSON METHOD Wn"); 
for(i=4;i<=n;ir+) 


{ 


#2 = func(x|i-2], yli-2]); 
£3 = fune(x{i-1],. у{1-1)); 
f4 = func(xli], у[1]); 


/* predicted value of y (by Milne's formula)*/ 
yli*1] = у[1-3] + 4.0*h/3.0 * (2.0*£2-£3+2,0*£4); 

х[1+1) = x[i] + h; 

#5 = func(x[i*1], у[ї+1)); 
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/* Corrected value of v (by Simpson's formula) */ 
yli+1] = yli-1]4h/3.0 * (£3+4.0*£4+£5); 
printf (“\п%50 $10.6f $10.6f*, i, х{ї+1], у[1+1)); 
1 
print£("\n\nvalue of y at x -.$f is $f", х(п+1], у[п+1]); 
) 
/* End of main() program */ 


/* Defining subroutine func()*/ 


float func(float x, float y) 
( 

float'f; 

fo. ЫЕ * age 

+ return(£); 


) 
/* End of subroutine funcíj */ 
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